Introduction
1. Getting Started
- 1.1. Multi-Model Database
- 1.2. Installation
- 1.3. Run the server
- 1.4. Run the console
- 1.5. Classes
- 1.6. Clusters
- 1.7. Record ID
- 1.8. SQL
- 1.9. Relationships
- 1.10. Working with Graphs
- 1.11. Using Schema with Graphs
- 1.12. Setup a Distributed Database
- 1.13. Working with Distributed Graphs
- 1.14. Java API
- 1.15. More on Tutorials
  - 1.15.1. Presentations
2. Basic Concepts
- 2.1. Supported Types
- 2.2. Inheritance
- 2.3. Schema
- 2.4. Cluster Selection
3. Fetching Strategies
4. Indexes
- 4.1. SB-Tree
- 4.2. Hash
- 4.3. Full Text
- 4.4. Lucene Full Text
- 4.5. Lucene Spatial
5. Security
- 5.1. SSL
6. Caching
7. Functions
8. Transaction
9. Hook - Triggers
- 9.1. Dynamic Hooks
- 9.2. Java (Native) Hooks
10. API
- 10.1. Graph or Document API?
- 10.2. SQL
- 10.3. Java API
- 10.4. Gremlin API
- 10.5. Javascript
  - 10.5.1. Javascript API
- 10.6. Scala API
- 10.7. HTTP API
- 10.8. Binary Protocol
11. Use Cases
- 11.1. Time Series
- 11.2. Key Value
12. Server
- 12.1. Embed the Server
- 12.2. Plugins
13. Studio
- 13.1. Query
- 13.2. Edit Document
- 13.3. Edit Vertex
- 13.4. Schema
- 13.5. Class
- 13.6. Graph Editor
- 13.7. Functions
- 13.8. Security
- 13.9. Database Management
- 13.10. Server Management
14. Console
- 14.1. Backup
- 14.2. Begin
- 14.3. Browse Class
- 14.4. Browse Cluster
- 14.5. Classes
- 14.6. Clusters
- 14.7. Commit
- 14.8. Config
- 14.9. Config Get
- 14.10. Config Set
- 14.11. Connect
- 14.12. Create Cluster
- 14.13. Create Database
- 14.14. Create Index
- 14.15. Create Link
- 14.16. Create Property
- 14.17. Declare Intent
- 14.18. Delete
- 14.19. Dictionary Get
- 14.20. Dictionary Keys
- 14.21. Dictionary Put
- 14.22. Dictionary Remove
- 14.23. Disconnect
- 14.24. Display Record
- 14.25. Drop Cluster
- 14.26. Drop Database
- 14.27. Export
- 14.28. Export Record
- 14.29. Freeze DB
- 14.30. Get
- 14.31. Grant
- 14.32. Import
- 14.33. Info
- 14.34. Info Class
- 14.35. Insert
- 14.36. Load Record
- 14.37. Profiler
- 14.38. Properties
- 14.39. Release DB
- 14.40. Reload Record
- 14.41. Restore
- 14.42. Revoke
- 14.43. Rollback
- 14.44. Set
15. Operations
- 15.1. Installation
- 15.2. Performance Tuning
- 15.3. ETL
- 15.4. Distributed Architecture
- 15.5. Backup and Restore
- 15.6. Export and Import
- 15.7. Logging
16. Enterprise Edition
17. Troubleshooting
- 17.1. Java
18. Available Plugins
- 18.1. Rexster
- 18.2. Gephi Graph Render
19. Upgrade
- 19.1. Backward compatibility
- 19.2. From 1.7.x to 2.0.x
- 19.3. From 1.6.x to 1.7.x
- 19.4. From 1.5.x to 1.6.x
- 19.5. From 1.4.x to 1.5.x
- 19.6. From 1.3.x to 1.4.x
20. Internals
- 20.1. Storages
- 20.2. Clusters
- 20.3. Limits
- 20.4. RidBag
- 20.5. SQL Syntax
- 20.6. Custom Index Engine
21. Contribute to OrientDB
- 21.1. The Team
- 21.2. Hackaton
- 21.3. Report an issue
22. Get in touch
Published using GitBook

OrientDB Manual

Relationships

The most important feature of a graph database is the management of relationships. Many users come to OrientDB from MongoDB or other document databases because they lack efficient support of relationships.

Relational Model

The relational model (and RDBMS - relational database management systems) has long been thought to be the best way to handle relationships. Graph databases suggest a more modern approach to this topic.

Most database developers are familiar with the relational model given it's 30+ years of dominance, spreading over generations of developers. Let's review how these systems manage relationships. As an example, we will use the relationships between the Customer and Address tables.

1-to-1 relationship

RDBMSs store the value of the target record in the "address" column of the Customer table. This is called a foreign key. The foreign key points to the primary key of the related record in the Address table:

RDBMS 1-to-1

To retrieve the address pointed to by customer "Luca", the query in a RDBMS would be:

SELECT B.location FROM Customer A, Address B WHERE A.name = 'Luca' AND A.address = B.id

This is a JOIN! A JOIN is executed at run-time every time you retrieve a relationship.

1-to-Many relationship

Since RDBMS have no concept of collections the Customer table cannot have multiple foreign keys. The way to manage a 1-to-Many relationship is by moving the foreign key to the Address table.

RDBMS 1-to-N

To extract all addresses of Customer 'Luca', the query in RDBMS reads:

SELECT B.location FROM Customer A, Address B WHERE A.name = 'Luca' AND B.customer = A.id

Many-to-Many relationship

The most complex case is the Many-to-Many relationship. To handle this type of association, RDBMSs need a separate, intermediary table that matches both Customer and Addresses in all required combinations. This results in a double JOIN per record at runtime;

RDBMS Many-to-Many

To extract all addresses of Customer 'Luca's the query in RDBMS becomes:

SELECT B.location FROM Customer A, Address B, CustomerAddress C WHERE A.name = 'Luca' AND B.id = A.id AND B.address = C.id

The problem with JOINS

With document and relational DBMS, the more data you have, the slower the database will perform. Joins have heavy runtime costs. In comparison, OrientDB handles relationships as physical links to the records, assigned only once when the edge is created O(1). Compare this to an RDBMS that “computes“ the relationship every single time you query a database O(LogN). With OrientDB, speed of traversal is not affected by the database size. It is always constant regardless if it has one record or 100 billion records. This is critical in the age of Big Data.

Searching for an ID at runtime each time you execute a query, for every record could be very expensive! The first optimization with RDMS is using indexes. Indexes speed up searches but they slow down INSERT, UPDATE and DELETE operations. In addition, they occupy substantial space on disk and in memory. You also need to qualify - are you sure the lookup into an index is actually fast? Let's try to understand how indexes work.

Do indexes solve the problem with JOIN?

The database industry has plenty of indexing algorithms. The most common in both Relational and NoSQL DBMS is the B+Tree. All balanced trees work in similar ways. Here is and example of how it would work when you're looking for "Luca": after only 5 hops the record is found.

RDBMS Indexes

But what if there were millions or billions of records? There would be many, many more hops. And this operation is executed on every JOIN per record! Imagine joining 4 tables with thousands of records: the number of JOINS could be in the millions!

Relations in OrientDB

OrientDB doesn't use JOINs. Instead it uses LINKs. A LINK is a relationship managed by storing the target RID in the source record. It's much like storing a pointer between 2 objects in memory. When you have Invoice -> Customer, then you have a pointer to Customer inside Invoice as an attribute. It's exactly the same. In this way it's like your database was in memory, a memory of several exabytes.

What about 1-to-N relationships? These relationships are handled as a collection of RIDs, like you would manage objects in memory. OrientDB supports different kinds of relationships:

LINK, to point to one record only
LINKSET, to point to several records. Like Java Sets, the same RID can only be included once. The pointers also have no order
LINKLIST, to point to several records. Like Java Lists, they are ordered and can contain duplicates
LINKMAP, to point to several records with a key stored in the source record. The Map values are the RIDs. Works like the Java Map<?,Record>.