Apache Cassandra with C#

Cassandra is an open-source, NoSQL database that manages large amounts of data across multiple servers. It is designed to be highly available, meaning it can continue to function even if some of its nodes fail. Cassandra is decentralized and scalable, and it provides high availability without a single point of failure.

Cassandra vs MongoDB

MongoDB is a document store database that works with collections containing multiple documents, whereas Cassandra is a column-oriented database.

MongoDB has a master-slave architecture, while Cassandra has a peer-to-peer architecture where all are master nodes in communication with each other.

	Cassandra	MongoDB
Pros	Highly available and scalable	Flexible and easy to use
	Good for handling large amounts of data and traffic	Good for storing and querying unstructured data
	Open source	Open source
Cons	Not as flexible as MongoDB	Not as highly available or scalable as Cassandra
	Not as good for storing unstructured data	Not as good for handling large amounts of data and traffic

Here are some common concepts in Cassandra:

Keyspace: A keyspace is a top-level namespace that defines data replication and other settings. It is similar to a schema in the relational database world. Typically, a cluster has one keyspace per application.
Column Family/Table: In Cassandra, data is organized into column families or tables. Each column family consists of rows that can have different numbers and types of columns. Column families are similar to tables in a relational database.
Column: A column is the basic unit of data storage in Cassandra. Each column has a name, value, and a timestamp. Columns are grouped together in rows or records.
Primary Key: A primary key is a unique identifier for a row in a Cassandra table. It consists of one or more columns that uniquely identify each row.
Partition Key: The partition key is part of the primary key and determines the partition in which the data is stored. Data in Cassandra is distributed across multiple nodes based on the partition key, allowing for scalable and distributed data storage.
Clustering Columns: Clustering columns are additional columns used to define the sorting order within a partition. They determine how data is ordered within a partition.
Replication: Cassandra is designed to be highly available and fault-tolerant. It achieves this through replication, where data is automatically copied and stored on multiple nodes. Replication factor and consistency level settings control how many copies of the data are stored and how consistency is maintained during read and write operations.
Consistency Level: Consistency level determines how many replicas need to respond to a read or write operation before it is considered successful. It defines the trade-off between consistency and availability in a distributed system.
Data Model: Cassandra follows a denormalized data model, where data is duplicated and stored in multiple tables to optimize read performance. It supports flexible schema design and allows for efficient querying using secondary indexes.
Tombstones: Tombstones are special markers used in Cassandra to represent deleted data. They are necessary to ensure eventual consistency in a distributed system and to propagate deletions across all replicas.

How to run Cassandra in docker

Run the following command from the same folder where the docker-compose.yml file is:
- docker-compose up -d
You can also run the Docker CLI command:
- docker run -d --name cassandra -p 9042:9042 cassandra
Connect to the Docker container using the following command:
- docker exec -it cassandra cqlsh

Types of NoSQL Databases

Document:

Store data in documents similar to JSON (JavaScript Object Notation) objects. Each document contains pairs of fields and values.

Graph:

Store data in nodes and edges. Commonly used when we need to traverse relationships to look for patterns such as social networks, fraud detection, and recommendation engines. Nodes: store information about people, places, and things. Edges: store relationships between the nodes

Neo4j

Key-Value:

Stores items as keys and values. A value can typically only be retrieved by referencing its value. Common use cases include storing user preferences or caching.

Wide-Column:

Store data in tables, rows, and dynamic columns. Wide-column stores provide a lot of flexibility over relational databases because each row is not required to have the same columns. Commonly used for storing Internet of Things (IoT) data and user profile data.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
src		src
.gitignore		.gitignore
README.md		README.md
docker-compose.yml		docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

src

src

.gitignore

.gitignore

README.md

README.md

docker-compose.yml

docker-compose.yml

Repository files navigation

Apache Cassandra with C#

Cassandra vs MongoDB

Here are some common concepts in Cassandra:

How to run Cassandra in docker

Types of NoSQL Databases

Document:

Graph:

Key-Value:

Wide-Column:

Links

About

Releases

Packages

Languages

LiteObject/cassandra-with-csharp

Folders and files

Latest commit

History

Repository files navigation

Apache Cassandra with C#

Cassandra vs MongoDB

Here are some common concepts in Cassandra:

How to run Cassandra in docker

Types of NoSQL Databases

Document:

Graph:

Key-Value:

Wide-Column:

Links

About

Topics

Resources

Stars

Watchers

Forks

Languages