<aside> đź’¬ This document is open for commenting.

</aside>

<aside> đź“– This document contains a lot of code snippets which are mandatory for reading, since the code itself and the comments inside it contain practical information, that, when applied to theory, makes understanding of union-db a lot easier.

</aside>

Table of contents

Intro

Queries are the way for clients to efficiently read information from the database. They are called like this, because in order implement them, one doesn’t need any special tools other than regular query canister methods and the database itself. In other words, union-db's Queries are just plain old canister queries, written with a little bit of extra knowledge in mind. By extra knowledge we mean two things:

To demonstrate how Queries work, let’s start with a simple example. Let’s imagine that we’re building a social app, where you can register, choose a username and chat to other users 1-on-1. Here is our database schema (just to keep it in front of our eyes - the database itself is schemaless, as you might remember):

struct Database {
  users: BTreeMap<User>,
	dialogues: BTreeMap<Dialogue>,
}

struct User {
  username: String,
  dialogue_ids: BTreeMap<Nat>,
}

struct Dialogue {
  messages: BTreeMap<Message>,
}

struct Message {
  timestamp: u64,
  sender: Principal,
  content: String,
}

let mut state = State::new();

Here we have two main collections: users and dialogues. Users are stored by their Principal interpreted as Nat. Dialogues are stored by some randomly generated Nat ids. Each two distinct users can only have one single dialogue between them. This means, that users and dialogues entities have a many-to-many (2-to-many) relationship between them. Each user has a username and a list of dialogue ids, in which they’re currently participating. Each such dialogue id in user’s list is stored by the Principal of another user. This means, that by knowing principals of any two users, we can retrieve their dialogue id. These ids are used to retrieve information about a dialogue - messages sent by each of two users. Messages consist of a timestamp, a sender’s principal and some textual content. We assume, that Messages are stored as Leafs - their fields can’t be split between shards and they are always complete. Messages and Dialogue entities have a one-to-many relationship between them (each dialog stores a lot of messages, but each message is stored only in one dialogue). This means, that we should store a list of messages in a dialogue as a subdocument of this dialogue. This is exactly what we do. We store messages in a Fork document inside a dialogue, each message is stored by some randomly generated Nat id.

Since this document is dedicated to Queries, we’re not going to describe how the information gets into this database. We just assume that it is already there, ready to be delivered. Learn more on how to write data to a union-db database in Transactions section.

Simple example

For starters, let’s begin with something simple. Let’s implement a query that returns a username of the user by it’s principal, so our users will be able to see usernames of each other.

#[query]
fn get_username(id: Principal) -> Result<Option<String>, Principal> {
	// constucts a key that points to "users : <principal> : username" document
	let key = CompositeKey::from_segments([str_key("users"), prin_key(id), str_key("username")]);

	// checks if this key is stored on this shard
	// if not, returns a principal of a neighboring shard to redirect to 
	state.check_bounds(&key)?;

	// if we've reached this line, this means that we're on a right shard
	// attempts to retrieve a username document (which is Leaf)
	// returns Option<Document>, where None means, that there is no such document exist in the database (the user is not registered)
	let username_blob_opt: Option<Document> = state.get(&key);

	if username_blob_opt.is_none() {
		return Ok(None);
	}

	// attempts to unwrap the document into a Leaf
	// then decodes the byte content of the leaf into String
	// decoding algorithm is not a subject of this documentation - use whatever you want
	let username: String = decode(username_blob_opt.unwrap().unwrap_leaf());

	Ok(Some(username))
}

This function accepts a user principal as an input and returns back a Result type, where Ok variant means that the query was executed against correct shard, that indeed stores (or is expected to store) this key, and where Err variant means that the client has to execute the same query function with the same arguments, but on another shard, that is supplied with the variant.

Yes, in union-db, clients are responsible for routing their Queries to correct shards (this is only valid for Queries - Transactions are handled differently). Shards will provide the necessary information to where should a client redirect their query, but the transportation of the payload between shards is expected from clients, and not shards. In other words, since our state is distributed between canisters, we need inter-canister calls to read it; but because we can’t do inter-canister query calls currently - we delegate them to clients, so they “make” these calls for us. This is both: very cheap and super fast.

The client’s algorithm is pretty simple: