Introduction
1. Getting Started
- 1.1. Multi-Model Database
- 1.2. Installation
- 1.3. Run the server
- 1.4. Run the console
- 1.5. Classes
- 1.6. Clusters
- 1.7. Record ID
- 1.8. SQL
- 1.9. Relationships
- 1.10. Working with Graphs
- 1.11. Using Schema with Graphs
- 1.12. Setup a Distributed Database
- 1.13. Working with Distributed Graphs
- 1.14. Java API
- 1.15. More on Tutorials
  - 1.15.1. Presentations
2. Basic Concepts
- 2.1. Supported Types
- 2.2. Inheritance
- 2.3. Schema
- 2.4. Cluster Selection
3. Fetching Strategies
4. Indexes
- 4.1. SB-Tree
- 4.2. Hash
- 4.3. Full Text
- 4.4. Lucene Full Text
- 4.5. Lucene Spatial
5. Security
- 5.1. SSL
6. Caching
7. Functions
8. Transaction
9. Hook - Triggers
- 9.1. Dynamic Hooks
- 9.2. Java (Native) Hooks
10. API
- 10.1. Graph or Document API?
- 10.2. SQL
- 10.3. Java API
- 10.4. Gremlin API
- 10.5. Javascript
  - 10.5.1. Javascript API
- 10.6. Scala API
- 10.7. HTTP API
- 10.8. Binary Protocol
11. Use Cases
- 11.1. Time Series
- 11.2. Key Value
12. Server
- 12.1. Embed the Server
- 12.2. Plugins
13. Studio
- 13.1. Query
- 13.2. Edit Document
- 13.3. Edit Vertex
- 13.4. Schema
- 13.5. Class
- 13.6. Graph Editor
- 13.7. Functions
- 13.8. Security
- 13.9. Database Management
- 13.10. Server Management
14. Console
- 14.1. Backup
- 14.2. Begin
- 14.3. Browse Class
- 14.4. Browse Cluster
- 14.5. Classes
- 14.6. Clusters
- 14.7. Commit
- 14.8. Config
- 14.9. Config Get
- 14.10. Config Set
- 14.11. Connect
- 14.12. Create Cluster
- 14.13. Create Database
- 14.14. Create Index
- 14.15. Create Link
- 14.16. Create Property
- 14.17. Declare Intent
- 14.18. Delete
- 14.19. Dictionary Get
- 14.20. Dictionary Keys
- 14.21. Dictionary Put
- 14.22. Dictionary Remove
- 14.23. Disconnect
- 14.24. Display Record
- 14.25. Drop Cluster
- 14.26. Drop Database
- 14.27. Export
- 14.28. Export Record
- 14.29. Freeze DB
- 14.30. Get
- 14.31. Grant
- 14.32. Import
- 14.33. Info
- 14.34. Info Class
- 14.35. Insert
- 14.36. Load Record
- 14.37. Profiler
- 14.38. Properties
- 14.39. Release DB
- 14.40. Reload Record
- 14.41. Restore
- 14.42. Revoke
- 14.43. Rollback
- 14.44. Set
15. Operations
- 15.1. Installation
- 15.2. Performance Tuning
- 15.3. ETL
- 15.4. Distributed Architecture
- 15.5. Backup and Restore
- 15.6. Export and Import
- 15.7. Logging
16. Enterprise Edition
17. Troubleshooting
- 17.1. Java
18. Available Plugins
- 18.1. Rexster
- 18.2. Gephi Graph Render
19. Upgrade
- 19.1. Backward compatibility
- 19.2. From 1.7.x to 2.0.x
- 19.3. From 1.6.x to 1.7.x
- 19.4. From 1.5.x to 1.6.x
- 19.5. From 1.4.x to 1.5.x
- 19.6. From 1.3.x to 1.4.x
20. Internals
- 20.1. Storages
- 20.2. Clusters
- 20.3. Limits
- 20.4. RidBag
- 20.5. SQL Syntax
- 20.6. Custom Index Engine
21. Contribute to OrientDB
- 21.1. The Team
- 21.2. Hackaton
- 21.3. Report an issue
22. Get in touch
Published using GitBook

OrientDB Manual

Distributed Architecture

OrientDB can be distributed across different servers and used in different ways to achieve the maximum of performance, scalability and robustness.

OrientDB uses the Hazelcast Open Source project to manage the clustering. Many of the references in this page are linked to the Hazelcast official documentation to get more information about such topic.

Presentation

Main topics

Creation of records (documents, vertices and edges)

In distributed mode the RID is assigned with cluster locality. If you have class Customer and 3 nodes (node1, node2, node3), you'll have these clusters:

customer with id=#15 (this is the default one, assigned to node1)
customer_node2 with id=#16
customer_node3 with id=#17

So if you create a new Customer on node1, it will get the RID with cluster-id of "customer" cluster: #15. The same operation on node2 will generate a RID with cluster-id=16 and 17 on node3.

In this way RID never collides and each node can be a master on insertion without any conflicts.

Distributed transactions

Starting from v1.6, OrientDB supports distributed transactions. When a transaction is committed, all the updated records are sent across all the servers, so each server is responsible to commit the transaction. In case one or more nodes fail on commit, the quorum is checked. If the quorum has been respected, then the failing nodes are aligned to the winner nodes, otherwise all the nodes rollback the transaction.

What about the visibility during distributed transaction?

During the distributed transaction, in case of rollback, there could be an amount of time when the records appear changed before they are rollbacked.

Limitations

OrientDB v 2.0.x has some limitations you should notice when you work in Distributed Mode:

hotAlignment:true could bring the database status as inconsistent. Please set it always to 'false`, the default
creation of a database on multiple nodes could cause synchronization problems when clusters are automatically created. Please create the databases before to run in distributed mode
split network case: this is not well managed and in case you setup 4 nodes and the network is split between 2 nodes on the left, and 2 nodes on the right, each partition will think to be the only survived and on rejoin database could be inconsistent. Please always setup an odd number of nodes, so there will always be a majority in quorum
if an error happen during CREATE RECORD, the operation is fixed across the entire cluster, but some node could have a wrong RID upper bound (the created record, then deleted as fix operation). In this case a new database deploy operation must be executed
Constraints with distributed databases could cause problems because some operations are executed at 2 steps: create + update. For example in some circumstance edges could be first created, then updated, but constraints like MANDATORY and NOTNULL against fields would fail at the first step making the creation of edges not possible on distributed mode.