JSON in SQL: Store, Query, and Index

Why JSON in a relational database?

Simple explanation

Think of a restaurant that serves both set menus and custom orders.

Your regular tables are like set menus — every dish (column) is predefined. But sometimes customers want something custom — extra toppings, special requests, dietary notes. Instead of adding a new column for every possible request, you store the custom details as a flexible note (JSON) alongside the set menu items.

JSON in SQL gives you the best of both worlds: strict relational structure for your core data, and flexible document storage for data that varies between rows — product attributes, user preferences, API responses, or configuration settings.

The JSON data type (SQL Server 2025)

SQL Server 2025 adds a native JSON data type that stores JSON in an optimised binary format — an alternative to storing JSON as NVARCHAR, with better performance and automatic validation.

CREATE TABLE Products (
    ProductId INT NOT NULL PRIMARY KEY,
    Name NVARCHAR(200) NOT NULL,
    Price DECIMAL(10,2) NOT NULL,
    Attributes JSON NULL    -- Flexible properties that vary by product
);

INSERT INTO Products (ProductId, Name, Price, Attributes)
VALUES (1, 'Wireless Mouse', 29.99,
    '{"color": "black", "dpi": 1600, "wireless": true, "battery": "AA"}');

Reading JSON: JSON_VALUE and OPENJSON

JSON_VALUE — extract a single scalar value

-- Get the color of product 1
SELECT Name, JSON_VALUE(Attributes, '$.color') AS Color
FROM Products
WHERE ProductId = 1;
-- Returns: Wireless Mouse, black

The $ represents the root. Navigate nested objects with dots: $.address.city.

OPENJSON — shred JSON into rows and columns

-- Expand a JSON array into rows
DECLARE @json NVARCHAR(MAX) = '[
    {"name": "Alice", "role": "Developer"},
    {"name": "Bob", "role": "DBA"},
    {"name": "Carol", "role": "Architect"}
]';

SELECT name, role
FROM OPENJSON(@json)
WITH (
    name NVARCHAR(100) '$.name',
    role NVARCHAR(100) '$.role'
);

OPENJSON is essential for converting JSON arrays (from API responses, for example) into relational rows you can join, filter, and aggregate.

Building JSON: JSON_OBJECT, JSON_ARRAY, JSON_ARRAYAGG

These functions construct JSON from relational data — the reverse of OPENJSON.

-- JSON_OBJECT: build a JSON object from key-value pairs
SELECT JSON_OBJECT('id': ProductId, 'name': Name, 'price': Price) AS ProductJson
FROM Products;
-- Returns: {"id":1,"name":"Wireless Mouse","price":29.99}

-- JSON_ARRAY: build a JSON array from values
SELECT JSON_ARRAY(1, 'hello', 3.14, NULL ABSENT ON NULL) AS ArrayResult;
-- Returns: [1,"hello",3.14]

-- JSON_ARRAYAGG: aggregate rows into a JSON array
SELECT JSON_ARRAYAGG(JSON_OBJECT('name': Name, 'price': Price)) AS AllProducts
FROM Products;
-- Returns: [{"name":"Wireless Mouse","price":29.99}, ...]

Exam tip: JSON_ARRAYAGG is an aggregate function

JSON_ARRAYAGG works like STRING_AGG or SUM — it collapses multiple rows into a single JSON array value. You can use it with GROUP BY to create nested JSON structures from relational data. This is commonly needed when preparing data for AI model input (Domain 3 — converting relational data to JSON for language model processing).

Searching JSON: JSON_CONTAINS

JSON_CONTAINS checks whether a JSON document contains a specified value or fragment:

-- Find all products where Attributes contains "wireless": true
SELECT Name, Price
FROM Products
WHERE JSON_CONTAINS(Attributes, 'true', '$.wireless') = 1;

Indexing JSON columns

Indexing NVARCHAR JSON (computed column approach)

For JSON stored as NVARCHAR, create computed columns that extract specific JSON values, then index those computed columns:

-- Add a computed column for the color property
ALTER TABLE Products
ADD Color AS JSON_VALUE(Attributes, '$.color');

-- Index it
CREATE NONCLUSTERED INDEX IX_Products_Color
ON Products (Color);

-- Now this query uses the index
SELECT Name, Price FROM Products WHERE Color = 'black';

Indexing native JSON type (SQL Server 2025)

For the native JSON data type, you can create a JSON index directly on a JSON path expression — no computed column needed:

-- Create a JSON index on a path expression
CREATE JSON INDEX IX_Products_Attributes_Color
ON Products (Attributes, '$.color');

This is simpler than the computed column approach and works only with the native JSON type.

Native JSON type vs NVARCHAR — prefer native when available
Approach	JSON in NVARCHAR	Native JSON Type (2025)
Storage format	Text (UTF-16)	Optimised binary
Storage size	Larger (text overhead)	Smaller (compressed binary)
Validation	ISJSON() check constraint	Automatic — rejects invalid JSON
Performance	Parse on every read	Pre-parsed, faster queries
Indexing	Computed columns + indexes	JSON indexes on path expressions
Compatibility	All SQL Server versions	SQL Server 2025+ and Azure SQL

Scenario: Leo's product catalog at SearchWave

Leo Torres at SearchWave is building a product catalog where every product category has different attributes. Electronics have “battery life” and “weight.” Clothing has “size” and “material.” Food has “allergens” and “expiry.”

Instead of creating columns for every possible attribute (most would be NULL), Leo stores the variable attributes as JSON:

CREATE TABLE Catalog (
    ProductId INT NOT NULL PRIMARY KEY,
    CategoryId INT NOT NULL,
    Name NVARCHAR(200) NOT NULL,
    BasePrice DECIMAL(10,2) NOT NULL,
    Specs JSON NULL  -- Category-specific attributes
);

He creates computed columns and indexes for the most-queried paths (like $.brand and $.rating), and uses OPENJSON for ad-hoc exploration of less common attributes.

This pattern — relational for core data, JSON for variable attributes — is a common exam scenario.

Question

What is the difference between JSON_VALUE and OPENJSON?

Click or press Enter to reveal answer

Answer

JSON_VALUE extracts a single scalar value from a JSON document (returns one value). OPENJSON shreds a JSON array or object into rows and columns (returns a table). Use JSON_VALUE for single lookups, OPENJSON for converting JSON arrays into relational rows.

Click to flip back

Question

How do you index a JSON property for fast lookups?

Click or press Enter to reveal answer

Answer

Create a computed column that extracts the JSON value using JSON_VALUE, then create a nonclustered index on that computed column. In SQL Server 2025 with the native JSON type, you can also create JSON indexes directly on path expressions.

Click to flip back

Question

What does JSON_ARRAYAGG do?

Click or press Enter to reveal answer

Answer

JSON_ARRAYAGG is an aggregate function that collapses multiple rows into a single JSON array. Like SUM or STRING_AGG, it works with GROUP BY. It is essential for converting relational query results into JSON format for API responses or AI model input.

Click to flip back

Knowledge Check

Priya at Vault Bank receives customer KYC (Know Your Customer) data from an external API as JSON arrays. She needs to insert this data into a relational Customers table. Which function should she use to convert the JSON array into rows?

Knowledge Check

Leo at SearchWave needs to generate a JSON API response that includes each product with its reviews nested as a JSON array. Which approach produces the correct nested JSON?

Next up: Programmability: Views, Functions, Procedures, and Triggers — create reusable database objects that encapsulate logic.