<head><meta http-equiv="Content-Type" content="text/html; charset=utf-8"/><title>Differences: Sharding vs. Partition</title></head><body><article class="page serif"><header><h1 class="page-title">Differences: Sharding vs. Partition</h1><p class="page-description"></p><table class="properties"><tbody><tr class="property-row property-row-created_by"><th><span class="icon property-icon"><svg role="graphics-symbol" viewBox="0 0 16 16" style="width:14px;height:14px;display:block;fill:rgba(55, 53, 47, 0.45);flex-shrink:0" class="typesCreatedBy"><path d="M8 15.126C11.8623 15.126 15.0615 11.9336 15.0615 8.06445C15.0615 4.20215 11.8623 1.00293 7.99316 1.00293C4.13086 1.00293 0.938477 4.20215 0.938477 8.06445C0.938477 11.9336 4.1377 15.126 8 15.126ZM8 10.4229C6.05176 10.4229 4.54785 11.1133 3.83008 11.9131C2.90039 10.9082 2.33301 9.55469 2.33301 8.06445C2.33301 4.91992 4.84863 2.39746 7.99316 2.39746C11.1377 2.39746 13.6738 4.91992 13.6738 8.06445C13.6738 9.55469 13.1064 10.9082 12.1699 11.9131C11.4521 11.1133 9.94824 10.4229 8 10.4229ZM8 9.30176C9.32617 9.30859 10.3516 8.18066 10.3516 6.71094C10.3516 5.33008 9.31934 4.18164 8 4.18164C6.6875 4.18164 5.6416 5.33008 5.64844 6.71094C5.65527 8.18066 6.68066 9.28809 8 9.30176Z"></path></svg></span>Created by</th><td><span class="user"><img src="Differences%20Sharding%20vs%20Partition%2011c6cd51990d80f8803ae41be196a8c0/IMG_2295.jpg" class="icon user-icon"/>JiaLin Huang</span></td></tr><tr class="property-row property-row-last_edited_time"><th><span class="icon property-icon"><svg role="graphics-symbol" viewBox="0 0 16 16" style="width:14px;height:14px;display:block;fill:rgba(55, 53, 47, 0.45);flex-shrink:0" class="typesCreatedAt"><path d="M8 15.126C11.8623 15.126 15.0615 11.9336 15.0615 8.06445C15.0615 4.20215 11.8623 1.00293 7.99316 1.00293C4.13086 1.00293 0.938477 4.20215 0.938477 8.06445C0.938477 11.9336 4.1377 15.126 8 15.126ZM8 13.7383C4.85547 13.7383 2.33301 11.209 2.33301 8.06445C2.33301 4.91992 4.84863 2.39746 7.99316 2.39746C11.1377 2.39746 13.6738 4.91992 13.6738 8.06445C13.6738 11.209 11.1445 13.7383 8 13.7383ZM4.54102 8.91211H7.99316C8.30078 8.91211 8.54004 8.67285 8.54004 8.37207V3.8877C8.54004 3.58691 8.30078 3.34766 7.99316 3.34766C7.69238 3.34766 7.45312 3.58691 7.45312 3.8877V7.83203H4.54102C4.2334 7.83203 4.00098 8.06445 4.00098 8.37207C4.00098 8.67285 4.2334 8.91211 4.54102 8.91211Z"></path></svg></span>Last edited</th><td><time>@2024年10月11日 19:14</time></td></tr><tr class="property-row property-row-multi_select"><th><span class="icon property-icon"><svg role="graphics-symbol" viewBox="0 0 16 16" style="width:14px;height:14px;display:block;fill:rgba(55, 53, 47, 0.45);flex-shrink:0" class="typesMultipleSelect"><path d="M1.91602 4.83789C2.44238 4.83789 2.87305 4.40723 2.87305 3.87402C2.87305 3.34766 2.44238 2.91699 1.91602 2.91699C1.38281 2.91699 0.952148 3.34766 0.952148 3.87402C0.952148 4.40723 1.38281 4.83789 1.91602 4.83789ZM5.1084 4.52344H14.3984C14.7607 4.52344 15.0479 4.23633 15.0479 3.87402C15.0479 3.51172 14.7607 3.22461 14.3984 3.22461H5.1084C4.74609 3.22461 4.45898 3.51172 4.45898 3.87402C4.45898 4.23633 4.74609 4.52344 5.1084 4.52344ZM1.91602 9.03516C2.44238 9.03516 2.87305 8.60449 2.87305 8.07129C2.87305 7.54492 2.44238 7.11426 1.91602 7.11426C1.38281 7.11426 0.952148 7.54492 0.952148 8.07129C0.952148 8.60449 1.38281 9.03516 1.91602 9.03516ZM5.1084 8.7207H14.3984C14.7607 8.7207 15.0479 8.43359 15.0479 8.07129C15.0479 7.70898 14.7607 7.42188 14.3984 7.42188H5.1084C4.74609 7.42188 4.45898 7.70898 4.45898 8.07129C4.45898 8.43359 4.74609 8.7207 5.1084 8.7207ZM1.91602 13.2324C2.44238 13.2324 2.87305 12.8018 2.87305 12.2686C2.87305 11.7422 2.44238 11.3115 1.91602 11.3115C1.38281 11.3115 0.952148 11.7422 0.952148 12.2686C0.952148 12.8018 1.38281 13.2324 1.91602 13.2324ZM5.1084 12.918H14.3984C14.7607 12.918 15.0479 12.6309 15.0479 12.2686C15.0479 11.9062 14.7607 11.6191 14.3984 11.6191H5.1084C4.74609 11.6191 4.45898 11.9062 4.45898 12.2686C4.45898 12.6309 4.74609 12.918 5.1084 12.918Z"></path></svg></span>Tags</th><td></td></tr></tbody></table></header><div class="page-body"><p class="">
</p><h1 class="">TL;DR</h1><p class="">Both techniques aim to <mark class="highlight-red"><strong>reduce the size of original tables</strong></mark></p><ul class="bulleted-list"><li style="list-style-type:disc"><strong>Partitioning</strong> is a feature <mark class="highlight-red">within a single database.</mark></li></ul><ul class="bulleted-list"><li style="list-style-type:disc"><strong>Sharding</strong> distributes data <mark class="highlight-red">across different machines (databases).</mark></li></ul><p class="">
</p><p class="">
</p><h1 class="">Sharding</h1><p class="">Sharding involves distributing data across multiple databases, typically on separate machines. This requires application-level logic to determine which database to use.</p><script src="https://cdnjs.cloudflare.com/ajax/libs/prism/1.29.0/prism.min.js" integrity="sha512-7Z9J3l1+EYfeaPKcGXu3MS/7T+w19WtKQY/n+xzmw4hZhJ9tyYmcUS+4QqAlzhicE5LAfMQSF3iFTK9bQdTxXg==" crossorigin="anonymous" referrerPolicy="no-referrer"></script><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/prism/1.29.0/themes/prism.min.css" integrity="sha512-tN7Ec6zAFaVSG3TpNAKtk4DOHNpSwKHxxrsiw4GHKESGPs5njn/0sMCUMl2svV4wo4BK/rCP7juYz+zx+l6oeQ==" crossorigin="anonymous" referrerPolicy="no-referrer"/><pre class="code"><code class="language-JavaScript">// to get the shard based on user ID
function getShardUrl(userId) {
  const shardIndex = userId % shardUrls.length;
  return shardUrls[shardIndex];
}

// Middleware to connect to the appropriate shard
async function connectToShard(req, res, next) {
  const userId = parseInt(req.params.userId);
  const shardUrl = getShardUrl(userId);

  try {
    const client = await MongoClient.connect(shardUrl, { useNewUrlParser: true, useUnifiedTopology: true });
    req.db = client.db();
    req.dbClient = client;
    next();
  } catch (error) {
    next(error);
  }
}

// Route to get user data
app.get(&#x27;/user/:userId&#x27;, connectToShard, async (req, res) =&gt; {
  try {
    const userId = parseInt(req.params.userId);
    const user = await req.db.collection(&#x27;users&#x27;).findOne({ userId });
    res.json(user);
  } catch (error) {
    res.status(500).json({ error: error.message });
  } finally {
    await req.dbClient.close();
  }
});</code></pre><p class="">
</p><p class="">
</p><h1 class="">Horizontal Partitioning</h1><p class="">Horizontal partitioning is suitable for large volumes of data, such as bank account transactions or historical newspaper classifications. </p><p class="">It divides data <mark class="highlight-red"><strong>based on row values</strong></mark>.</p><p class="">Example in PostgreSQL:</p><script src="https://cdnjs.cloudflare.com/ajax/libs/prism/1.29.0/prism.min.js" integrity="sha512-7Z9J3l1+EYfeaPKcGXu3MS/7T+w19WtKQY/n+xzmw4hZhJ9tyYmcUS+4QqAlzhicE5LAfMQSF3iFTK9bQdTxXg==" crossorigin="anonymous" referrerPolicy="no-referrer"></script><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/prism/1.29.0/themes/prism.min.css" integrity="sha512-tN7Ec6zAFaVSG3TpNAKtk4DOHNpSwKHxxrsiw4GHKESGPs5njn/0sMCUMl2svV4wo4BK/rCP7juYz+zx+l6oeQ==" crossorigin="anonymous" referrerPolicy="no-referrer"/><pre class="code"><code class="language-SQL">-- table
CREATE TABLE sales (
    id SERIAL,
    sale_date DATE NOT NULL,
    amount DECIMAL(10,2)
) PARTITION BY RANGE (sale_date);

-- partition
CREATE TABLE sales_2023 PARTITION OF sales
    FOR VALUES FROM (&#x27;2023-01-01&#x27;) TO (&#x27;2024-01-01&#x27;);

-- partition
CREATE TABLE sales_2024 PARTITION OF sales
    FOR VALUES FROM (&#x27;2024-01-01&#x27;) TO (&#x27;2025-01-01&#x27;);

-- Data insertion 
-- PostgreSQL automatically routes to the correct partition
INSERT INTO sales (sale_date, amount) VALUES (&#x27;2023-05-01&#x27;, 100.00);
INSERT INTO sales (sale_date, amount) VALUES (&#x27;2024-02-15&#x27;, 200.00);</code></pre><p class="">
</p><p class="">
</p><h1 class="">Vertical Partitioning</h1><p class="">Vertical partitioning is used when an original data structure (e.g., a User type) becomes too <mark class="highlight-red"><strong>BAGGY</strong></mark>. It splits the structure into frequently used (basic) and less frequently used (extended) parts.</p><p class="">Example:</p><script src="https://cdnjs.cloudflare.com/ajax/libs/prism/1.29.0/prism.min.js" integrity="sha512-7Z9J3l1+EYfeaPKcGXu3MS/7T+w19WtKQY/n+xzmw4hZhJ9tyYmcUS+4QqAlzhicE5LAfMQSF3iFTK9bQdTxXg==" crossorigin="anonymous" referrerPolicy="no-referrer"></script><link rel="stylesheet" href="https://cdnjs.cloudflare.com/ajax/libs/prism/1.29.0/themes/prism.min.css" integrity="sha512-tN7Ec6zAFaVSG3TpNAKtk4DOHNpSwKHxxrsiw4GHKESGPs5njn/0sMCUMl2svV4wo4BK/rCP7juYz+zx+l6oeQ==" crossorigin="anonymous" referrerPolicy="no-referrer"/><pre class="code"><code class="language-SQL">-- origin table user
-- CREATE TABLE users (
--     user_id SERIAL PRIMARY KEY,
--     username VARCHAR(50),
--     email VARCHAR(100),
--     registration_date DATE,
--     profile_picture BYTEA,
--     bio TEXT
-- );

-- vertical partition
CREATE TABLE users_basic (
    user_id SERIAL PRIMARY KEY,
    username VARCHAR(50),
    email VARCHAR(100),
    registration_date DATE
);

CREATE TABLE users_extended (
    user_id INT PRIMARY KEY,
    profile_picture BYTEA,
    bio TEXT,
    FOREIGN KEY (user_id) REFERENCES users_basic(user_id)
);

-- Data migration
INSERT INTO users_basic (user_id, username, email, registration_date)
SELECT user_id, username, email, registration_date FROM users;

INSERT INTO users_extended (user_id, profile_picture, bio)
SELECT user_id, profile_picture, bio FROM users;</code></pre><p class="">
</p><p class=""><strong>Note</strong>: In the original table, <code>user_id</code> is <code>SERIAL</code> (PostgreSQL-specific), which automatically generates a unique ID. In the <code>users_extended</code> table, <code>user_id</code> is declared as INT and references the <code>user_id</code> in the <code>users_basic</code> table via a FOREIGN KEY, linking the two tables.</p><p class="">
</p></div></article><span class="sans" style="font-size:14px;padding-top:2em"></span></body>