NoSQL vs SQL — Document Model, Collections, and When to Use MongoDB

📚 MEAN Stack 📂 Chapter 8: MongoDB Fundamentals 📄 Lesson 5010 Beginner 🕒 August 1, 2025

MongoDB is a document database — it stores data as flexible, JSON-like documents rather than rows in rigid tables. Before diving into MongoDB syntax, it is essential to understand why the document model exists, how it differs from relational databases, and when MongoDB is the right tool for the job. Making this choice correctly from the start prevents painful migrations later. In a MEAN Stack application, MongoDB is the persistence layer that stores every user, task, session, and log your application generates — understanding its foundations thoroughly is non-negotiable.

Relational (SQL) vs Document (NoSQL) Databases

Concept	SQL (PostgreSQL, MySQL)	MongoDB
Data unit	Row in a table	Document in a collection
Data format	Fixed columns, typed rows	Flexible JSON-like BSON document
Schema	Defined upfront — DDL required to change	Flexible — each document can have different fields
Relationships	Foreign keys, JOINs across tables	Embedded sub-documents or references (`$lookup`)
Scaling	Vertical (bigger server) or complex sharding	Horizontal sharding built in
Transactions	Full ACID across any tables	ACID within a single document; multi-document since v4
Query language	SQL — `SELECT * FROM tasks WHERE status = 'done'`	MQL — `db.tasks.find({ status: 'done' })`
Best for	Complex joins, strict consistency, financial data	Flexible schemas, hierarchical data, rapid iteration

MongoDB Terminology vs SQL Terminology

SQL Term	MongoDB Term	Description
Database	Database	Container for collections
Table	Collection	Group of related documents
Row	Document	A single JSON-like record
Column	Field	A key-value pair in a document
Primary key	`_id`	Unique identifier, auto-generated as ObjectId
Foreign key	Reference (ObjectId)	A field storing another document’s `_id`
JOIN	`$lookup`	Aggregation pipeline stage
Index	Index	Data structure for fast queries
View	View / Aggregation	Read-only virtual collection
Stored procedure	JavaScript functions (legacy)	Application-level logic is preferred

When to Choose MongoDB

MongoDB Excels At	SQL Is a Better Choice
Storing hierarchical data naturally (user with embedded addresses)	Many-to-many relationships requiring complex JOINs
Rapidly evolving schemas during product development	Financial transactions requiring strict ACID across tables
High write throughput — IoT, event logs, real-time analytics	Reporting and business intelligence with complex aggregations
Horizontal scaling across many servers	Existing team with deep SQL expertise
Documents of varying shape in the same collection	Regulatory requirements mandating relational integrity
Catalogues, product data, content management	Payroll, banking, ERP systems

Note: MongoDB documents are stored internally as BSON (Binary JSON) — a binary-encoded serialisation of JSON-like documents. BSON extends JSON with additional types: Date, ObjectId, Decimal128, Binary, and Timestamp. When you read a document via Mongoose or the MongoDB driver, BSON is automatically deserialised to JavaScript objects, so you work with plain JS objects in your code. BSON is only relevant when you inspect raw database storage.

Tip: MongoDB’s flexible schema is a feature, not a bug — but it requires discipline. Without Mongoose schemas, a typo in field names silently creates new fields instead of raising an error. Always define schemas with Mongoose in a MEAN Stack application. Mongoose adds type coercion, validation, default values, and virtual properties on top of MongoDB’s raw storage, giving you the flexibility of MongoDB with the safety of a defined contract.

Warning: The ObjectId type is MongoDB’s default primary key format — a 12-byte value that encodes a timestamp, machine identifier, and counter. It appears as a 24-character hex string like 64a1f2b3c8e4d5f6a7b8c9d0. ObjectId fields are not plain strings — they must be converted with new ObjectId('...') when querying by _id. Mongoose handles this automatically, but when using the raw MongoDB driver you must convert string IDs before querying.

MongoDB Document Structure

// A MongoDB document — stored as BSON, accessed as JavaScript object
{
    "_id": ObjectId("64a1f2b3c8e4d5f6a7b8c9d0"),  // auto-generated unique ID
    "title":       "Complete project proposal",
    "description": "Write the Q4 project proposal for the client",
    "status":      "in-progress",
    "priority":    "high",
    "completed":   false,
    "dueDate":     ISODate("2025-11-30T00:00:00.000Z"),

    // Embedded sub-document — related data stored together
    "assignee": {
        "userId": ObjectId("64a1f2b3c8e4d5f6a7b8c9d1"),
        "name":   "Alice Chen",
        "email":  "alice@example.com"
    },

    // Array of strings
    "tags": ["proposal", "Q4", "client"],

    // Array of embedded documents
    "attachments": [
        {
            "filename": "proposal-draft-v1.pdf",
            "size":     204800,
            "uploadedAt": ISODate("2025-11-01T10:00:00.000Z")
        }
    ],

    // Reference to another collection document (like a foreign key)
    "projectId": ObjectId("64a1f2b3c8e4d5f6a7b8c9d2"),

    // Auto-managed timestamps (with Mongoose { timestamps: true })
    "createdAt": ISODate("2025-10-15T09:30:00.000Z"),
    "updatedAt": ISODate("2025-11-01T14:22:00.000Z")
}

// Same user collection — different documents can have different fields
// (flexible schema — not possible in SQL without ALTER TABLE)
{ "_id": ObjectId("..."), "name": "Bob", "email": "bob@example.com", "role": "admin" }
{ "_id": ObjectId("..."), "name": "Carol", "email": "carol@example.com", "oauthProvider": "google" }
{ "_id": ObjectId("..."), "name": "Dave", "email": "dave@example.com", "phone": "+1-555-0100" }

Database, Collection, and Document Hierarchy

MongoDB Server (mongod process)
 |
 +-- Database: taskmanager
 |    |
 |    +-- Collection: users
 |    |    +-- Document: { _id, name, email, password, role, createdAt }
 |    |    +-- Document: { _id, name, email, password, role, avatar, createdAt }
 |    |    +-- Document: { _id, name, email, password, role, createdAt }
 |    |
 |    +-- Collection: tasks
 |    |    +-- Document: { _id, title, status, priority, user, createdAt }
 |    |    +-- Document: { _id, title, status, priority, user, dueDate, tags, createdAt }
 |    |    +-- Document: { _id, title, status, priority, user, attachments, createdAt }
 |    |
 |    +-- Collection: sessions     (optional — for refresh tokens)
 |         +-- Document: { _id, userId, token, expiresAt, createdAt }
 |
 +-- Database: taskmanager_test    (separate test database)
      +-- Collection: users
      +-- Collection: tasks

How It Works

Step 1 — Documents Map Naturally to JavaScript Objects

A MongoDB document is a JSON-like object — fields are key-value pairs, values can be strings, numbers, booleans, dates, arrays, or nested objects. When your Node.js application reads a document, it receives a plain JavaScript object. No ORM mapping from table rows to objects is needed. A task document in MongoDB looks exactly like the task object your Angular component works with.

A collection is roughly analogous to a SQL table, but without a schema constraint. All user documents live in the users collection; all task documents live in tasks. By convention, collection names are lowercase plural nouns. MongoDB creates collections automatically the first time you insert a document — no CREATE TABLE statement is required.

Step 3 — The _id Field Is Always Unique

Every MongoDB document has an _id field. If you do not supply one when inserting, MongoDB generates an ObjectId automatically. An ObjectId is not random — its first 4 bytes encode a Unix timestamp, meaning ObjectIds are roughly sortable by creation time. You can also use a string or integer as _id if you need human-readable IDs — just ensure uniqueness yourself.

Step 4 — Embedded Documents vs References

MongoDB offers two ways to model relationships. Embedding stores related data inside the parent document (a user’s address fields inside the user document). It is fast for reads because everything is in one document, but makes it harder to update the sub-data independently. Referencing stores an ObjectId pointing to another document (a task’s userId field pointing to a user document) — analogous to a foreign key — and requires a separate query or $lookup to join them.

Step 5 — MongoDB Atlas Is the Production-Ready Cloud Option

MongoDB Atlas is the fully managed cloud database service. The free M0 tier provides 512MB of storage — more than enough for development and small production workloads. Atlas handles backups, monitoring, scaling, and security patches. In a MEAN Stack production deployment, your Express API connects to Atlas via a connection string rather than a locally installed MongoDB server. The connection string format is: mongodb+srv://username:password@cluster.mongodb.net/dbname.

Real-World Example: Data Modelling Decision

// ── Option A: Embedded — store user info inside each task document ─────────
// Pros: single query gets task + assignee info
// Cons: if user changes name/email, must update every task they own
{
    "_id": ObjectId("..."),
    "title": "Review PR",
    "user": {
        "_id":   ObjectId("64a1f..."),
        "name":  "Alice",
        "email": "alice@example.com"
    }
}

// ── Option B: Referenced — store only the user's _id in the task ──────────
// Pros: user data stays in sync (update once in users collection)
// Cons: requires populate() or $lookup to get user info with task
{
    "_id":   ObjectId("..."),
    "title": "Review PR",
    "user":  ObjectId("64a1f...")   // reference to users collection
}

// Mongoose populate() resolves the reference in a second query:
const task = await Task.findById(id).populate('user', 'name email');
// Result:
// { _id: ..., title: 'Review PR', user: { _id: ..., name: 'Alice', email: '...' } }

// ── Decision guide for MEAN Stack Task Manager ────────────────────────────
// Reference (separate documents):
//   user inside task           → userId: ObjectId  (user can be updated independently)
//   project inside task        → projectId: ObjectId

// Embed (sub-document inside parent):
//   task attachments           → attachments: [{ filename, size, url }]
//   task history/activity log  → activity: [{ action, timestamp, by }]
//   user notification prefs    → notifications: { email: true, push: false }

Common Mistakes

Mistake 1 — Choosing MongoDB when relational data is required

❌ Wrong assumption — MongoDB for a complex financial system:

Orders, order_items, products, inventory, payments, refunds — all with cross-collection
consistency guarantees. Multi-document transactions are possible in MongoDB but complex.
SQL handles this natively with ACID transactions and foreign key constraints.

✅ Correct — match the tool to the data model:

Task Manager, CMS, Product Catalogue, User Profiles, Event Logs → MongoDB (document-shaped)
E-commerce inventory, Banking, Payroll, ERP → PostgreSQL (relational integrity critical)

Mistake 2 — Storing ObjectId as a string in application code

❌ Wrong — comparing string to ObjectId always fails:

const taskUserId = task.user;           // ObjectId("64a1f...")
const currentUser = req.user.id;        // '64a1f...' (string from JWT)
if (taskUserId === currentUser) { }     // false! ObjectId !== string

✅ Correct — convert to string for comparison:

if (task.user.toString() === req.user.id) { }  // true
// OR let Mongoose handle it:
const task = await Task.findOne({ _id: id, user: req.user.id }); // Mongoose coerces string to ObjectId

Mistake 3 — Deeply nesting documents more than 2 levels

❌ Wrong — 4-level nesting becomes unmaintainable:

// Avoid:
{
    project: {
        team: {
            members: [
                { user: { tasks: [{ comments: [...] }] } }
            ]
        }
    }
}

✅ Correct — keep embedding shallow, use references for deep relationships:

// Separate collections with references:
// projects: { _id, name, teamId }
// teams:    { _id, projectId, memberIds: [ObjectId] }
// tasks:    { _id, title, projectId, userId }
// comments: { _id, taskId, userId, text }

Quick Reference

Term	Meaning	SQL Equivalent
Database	Named container for collections	Database / Schema
Collection	Group of documents (no fixed schema)	Table
Document	JSON-like record with `_id`	Row
Field	Key-value pair inside a document	Column
ObjectId	12-byte auto-generated unique ID	Auto-increment primary key
Embedded document	Nested object inside a document	Denormalised column group
Reference	ObjectId pointing to another collection	Foreign key
`$lookup`	Join two collections in aggregation	JOIN
Index	B-tree structure for fast field queries	Index
Atlas	Managed cloud MongoDB service	Amazon RDS / Azure SQL

Relational (SQL) vs Document (NoSQL) Databases #

MongoDB Terminology vs SQL Terminology #

When to Choose MongoDB #

MongoDB Document Structure #

Database, Collection, and Document Hierarchy #

How It Works #

Step 1 — Documents Map Naturally to JavaScript Objects #

Step 2 — Collections Group Related Documents #

Step 3 — The _id Field Is Always Unique #

Step 4 — Embedded Documents vs References #

Step 5 — MongoDB Atlas Is the Production-Ready Cloud Option #

Real-World Example: Data Modelling Decision #

Common Mistakes #

Mistake 1 — Choosing MongoDB when relational data is required #

Mistake 2 — Storing ObjectId as a string in application code #

Mistake 3 — Deeply nesting documents more than 2 levels #

Quick Reference #

🧠 Test Yourself #

📚 More in this Tutorial Series

Relational (SQL) vs Document (NoSQL) Databases

MongoDB Terminology vs SQL Terminology

When to Choose MongoDB

MongoDB Document Structure

Database, Collection, and Document Hierarchy

How It Works

Step 1 — Documents Map Naturally to JavaScript Objects

Step 2 — Collections Group Related Documents

Step 3 — The _id Field Is Always Unique

Step 4 — Embedded Documents vs References

Step 5 — MongoDB Atlas Is the Production-Ready Cloud Option

Real-World Example: Data Modelling Decision

Common Mistakes

Mistake 1 — Choosing MongoDB when relational data is required

Mistake 2 — Storing ObjectId as a string in application code

Mistake 3 — Deeply nesting documents more than 2 levels

Quick Reference

🧠 Test Yourself