Skip to end of metadata
Go to start of metadata

You are viewing an old version of this content. View the current version.

Compare with Current View Version History

« Previous Version 4 Next »

What is Sharding or Data Partitioning ?

Sharding (also known as Data Partitioning) is the process of splitting a large dataset into many small partitions which are placed on different machines. Each partition is known as a "shard".

Each shard has the same database schema as the original database. Most data is distributed such that each row appears in exactly one shard. The combined data from all shards is the same as the data from the original database.

On this page.

What scalability issues are solved by Sharding ?

  • Performance degradation with a single DB server architecture.

  • Queries and Updates = slower; network bandwidth starts to saturate.

  • Disk space is running out.

What are some common Sharding Schemes ?

Strategy 1 - Horizontal or Range Based Sharding

Strategy 2 - Vertical Sharding

Strategy 3 - Key or Hash based Sharding

Strategy 4 - Directory based Sharding

What are the common problems with Sharding ?

xxx

  • No labels