Introduction
1. Getting Started
- 1.1. Multi-Model Database
- 1.2. Installation
- 1.3. Run the server
- 1.4. Run the console
- 1.5. Classes
- 1.6. Clusters
- 1.7. Record ID
- 1.8. SQL
- 1.9. Relationships
- 1.10. Working with Graphs
- 1.11. Using Schema with Graphs
- 1.12. Setup a Distributed Database
- 1.13. Working with Distributed Graphs
- 1.14. Java API
- 1.15. More on Tutorials
  - 1.15.1. Presentations
2. Basic Concepts
- 2.1. Supported Types
- 2.2. Inheritance
- 2.3. Schema
- 2.4. Cluster Selection
3. Fetching Strategies
4. Indexes
- 4.1. SB-Tree
- 4.2. Hash
- 4.3. Full Text
- 4.4. Lucene Full Text
- 4.5. Lucene Spatial
5. Security
- 5.1. SSL
6. Caching
7. Functions
8. Transaction
9. Hook - Triggers
- 9.1. Dynamic Hooks
- 9.2. Java (Native) Hooks
10. API
- 10.1. Graph or Document API?
- 10.2. SQL
- 10.3. Java API
- 10.4. Gremlin API
- 10.5. Javascript
  - 10.5.1. Javascript API
- 10.6. Scala API
- 10.7. HTTP API
- 10.8. Binary Protocol
11. Use Cases
- 11.1. Time Series
- 11.2. Key Value
12. Server
- 12.1. Embed the Server
- 12.2. Plugins
13. Studio
- 13.1. Query
- 13.2. Edit Document
- 13.3. Edit Vertex
- 13.4. Schema
- 13.5. Class
- 13.6. Graph Editor
- 13.7. Functions
- 13.8. Security
- 13.9. Database Management
- 13.10. Server Management
14. Console
- 14.1. Backup
- 14.2. Begin
- 14.3. Browse Class
- 14.4. Browse Cluster
- 14.5. Classes
- 14.6. Clusters
- 14.7. Commit
- 14.8. Config
- 14.9. Config Get
- 14.10. Config Set
- 14.11. Connect
- 14.12. Create Cluster
- 14.13. Create Database
- 14.14. Create Index
- 14.15. Create Link
- 14.16. Create Property
- 14.17. Declare Intent
- 14.18. Delete
- 14.19. Dictionary Get
- 14.20. Dictionary Keys
- 14.21. Dictionary Put
- 14.22. Dictionary Remove
- 14.23. Disconnect
- 14.24. Display Record
- 14.25. Drop Cluster
- 14.26. Drop Database
- 14.27. Export
- 14.28. Export Record
- 14.29. Freeze DB
- 14.30. Get
- 14.31. Grant
- 14.32. Import
- 14.33. Info
- 14.34. Info Class
- 14.35. Insert
- 14.36. Load Record
- 14.37. Profiler
- 14.38. Properties
- 14.39. Release DB
- 14.40. Reload Record
- 14.41. Restore
- 14.42. Revoke
- 14.43. Rollback
- 14.44. Set
15. Operations
- 15.1. Installation
- 15.2. Performance Tuning
- 15.3. ETL
- 15.4. Distributed Architecture
- 15.5. Backup and Restore
- 15.6. Export and Import
- 15.7. Logging
16. Enterprise Edition
17. Troubleshooting
- 17.1. Java
18. Available Plugins
- 18.1. Rexster
- 18.2. Gephi Graph Render
19. Upgrade
- 19.1. Backward compatibility
- 19.2. From 1.7.x to 2.0.x
- 19.3. From 1.6.x to 1.7.x
- 19.4. From 1.5.x to 1.6.x
- 19.5. From 1.4.x to 1.5.x
- 19.6. From 1.3.x to 1.4.x
20. Internals
- 20.1. Storages
- 20.2. Clusters
- 20.3. Limits
- 20.4. RidBag
- 20.5. SQL Syntax
- 20.6. Custom Index Engine
21. Contribute to OrientDB
- 21.1. The Team
- 21.2. Hackaton
- 21.3. Report an issue
22. Get in touch
Published using GitBook

OrientDB Manual

Clusters

We've already talked about classes. A class is a logical concept in OrientDB. Clusters are also an important concept in OrientDB. Records (or documents/vertices) are stored in clusters.

What is a cluster?

A cluster is a place where a group of records are stored. Perhaps the best equivalent in the relational world would be a Table. By default, OrientDB will create one cluster per class. All the records of a class are stored in the same cluster which has the same name as the class. You can create up to 32,767 (2^15-1) clusters in a database.

Understanding the concepts of classes and clusters allows you to take advantage of the power of clusters while designing your new database.

Even though the default strategy is that each class maps to one cluster, a class can rely on multiple clusters. You can spawn records physically in multiple places, thereby creating multiple clusters. For example:

Class-Custer

The class "Customer" relies on 2 clusters:

USA_customers, containing all USA customers. This is the default cluster as denoted by the red star.
China_customers, containing all Chinese customers.

The default cluster (in this case, the USA_customers cluster) is used by default when the generic class "Customer" is used. Example:

Class-Custer

When querying the "Customer" class, all the involved clusters are scanned:

Class-Custer

If you know the location of a customer you're looking for you can query the target cluster directly. This avoids scanning the other clusters and optimizes the query:

Class-Cluster

To add a new cluster to a class, use the ALTER CLASS command. To remove a cluster use REMOVECLUSTER in ALTER CLASS command. Example to create the cluster "USA_Customers" under the "Customer" class:

ALTER CLASS Customer ADDCLUSTER USA_Customers

The benefits of using different physical places to store records are:

faster queries against clusters because only a sub-set of all the class's clusters must be searched
good partitioning allows you to reduce/remove the use of indexes
parallel queries if on multiple disks
sharding large data sets across multiple disks or server instances

There are two types of clusters:

Physical Cluster (known as local) which is persistent because it writes directly to the file system
Memory Cluster where everything is volatile and will be lost on termination of the process or server if the database is remote

For most cases physical clusters are preferred because the database must be persistent. OrientDB creates physical clusters by default so you don't have to worry too much about it for now.

To view all clusters, from the console run the clusters command:

orientdb> clusters

CLUSTERS:
----------------------------------------------+------+---------------------+-----------+
 NAME                                         |  ID  | TYPE                | RECORDS   |
----------------------------------------------+------+---------------------+-----------+
 account                                      |    11| PHYSICAL            |      1107 |
 actor                                        |    91| PHYSICAL            |         3 |
 address                                      |    19| PHYSICAL            |       166 |
 animal                                       |    17| PHYSICAL            |         0 |
 animalrace                                   |    16| PHYSICAL            |         2 |
 ....                                         |  ....| ....                |      .... |
----------------------------------------------+------+---------------------+-----------+
 TOTAL                                                                           23481 |
---------------------------------------------------------------------------------------+

Since by default each class has its own cluster, we can query the database's users by class or by cluster:

orientdb> browse cluster OUser

---+---------+--------------------+--------------------+--------------------+--------------------
  #| RID     |name                |password            |status              |roles
---+---------+--------------------+--------------------+--------------------+--------------------
  0|     #5:0|admin               |{SHA-256}8C6976E5B5410415BDE908BD4DEE15DFB167A9C873FC4BB8A81F6F2AB448A918|ACTIVE              |[1]
  1|     #5:1|reader              |{SHA-256}3D0941964AA3EBDCB00CCEF58B1BB399F9F898465E9886D5AEC7F31090A0FB30|ACTIVE              |[1]
  2|     #5:2|writer              |{SHA-256}B93006774CBDD4B299389A03AC3D88C3A76B460D538795BC12718011A909FBA5|ACTIVE              |[1]
---+---------+--------------------+--------------------+--------------------+--------------------

The result is identical to browse class ouser executed in the classes section because there is only one cluster for the OUser class in this example.

The strategy where OrientDB selects the cluster when inserts a new record is configurable and pluggable. For more information take a look at Cluster Selection.