Blog | Siva Nekkanti

May 4, 2026

Where Data Engineering Meets AI Product Engineering

My current view on the overlap between reliable data platforms and useful AI products.

Feb 2, 2026

Lessons From Reliable Data Systems

A synthesis of what years of pipeline, privacy, and ML infrastructure work have taught me.

Nov 3, 2025

Data Engineering for Agentic Products

How tool-using AI products raise the bar for logs, permissions, and evaluation data.

Aug 4, 2025

Building Trustworthy Feature Stores

What feature infrastructure needs beyond a place to store feature values.

May 5, 2025

The Hard Part of Reliable AI

Why reliable AI products depend on ordinary engineering discipline around changing systems.

Feb 3, 2025

Measuring Retrieval Quality

A deeper look at evaluating the data layer behind LLM product behavior.

Nov 4, 2024

From Pipelines to Platforms

How my perspective shifted from building individual workflows to enabling teams.

Aug 5, 2024

AI Systems Need Data Contracts Too

Why prompts, retrieval context, and evaluation data need explicit interfaces.

May 6, 2024

Designing for Data Minimization

How privacy constraints can lead to cleaner, more purposeful data architecture.

Feb 5, 2024

Reliable Data Systems Have Memory

A more mature view of run history, decisions, and institutional knowledge.

Nov 6, 2023

Privacy in AI Systems

Why AI products make data minimization and permission boundaries even more important.

Aug 7, 2023

Evaluating LLM Products

A practical view of evaluation data, feedback loops, and product quality.

May 8, 2023

Retrieval Is Data Engineering

How LLM applications made familiar data quality problems show up in a new interface.

Feb 6, 2023

Data Platforms Need Product Thinking

Why internal platforms should be designed around workflows, not just capabilities.

Nov 7, 2022

Cost Is an Observability Signal

How cloud spend helped me see inefficient data systems before they became incidents.

Aug 8, 2022

Deleting Data Is Engineering

Why retention, deletion, and lifecycle controls deserve real design attention.

May 9, 2022

Serving Features Reliably

A note on the gap between offline feature logic and production serving expectations.

Feb 7, 2022

ML Infrastructure Is a Feedback System

Why I started thinking beyond training pipelines and toward learning loops.

Nov 8, 2021

Testing Transformations Without Pretending Data Is Code

A more practical view of testing analytics and pipeline logic.

Aug 9, 2021

Lineage as a Debugging Tool

How lineage became more useful to me when I stopped treating it as a catalog feature.

May 10, 2021

Privacy Is a Systems Property

Why privacy engineering belongs in architecture decisions, not only review checklists.

Feb 8, 2021

Feature Pipelines Are Data Products

How ML feature work changed the way I thought about ownership and interfaces.

Nov 9, 2020

Observability for Humans

A reflection on alerts, dashboards, and making data systems easier to operate.

Aug 10, 2020

Data Contracts Before They Were Fashionable

How producer-consumer expectations became a recurring theme in my data engineering work.

May 11, 2020

Idempotency as Calm

Why rerunnable jobs made data operations feel less dramatic.

Feb 17, 2020

Reliability During Uncertainty

How changing business conditions made me think harder about freshness, drift, and operational signals.

Nov 18, 2019

Documenting the Weird Parts

Why the most useful data documentation often explains exceptions, not happy paths.

Aug 12, 2019

Partitions and Practical Performance

A note on learning to make warehouse performance understandable instead of mysterious.

May 13, 2019

Backfills Taught Me Humility

What historical data corrections revealed about assumptions hidden inside pipelines.

Feb 11, 2019

Data Quality Is Product Quality

How I began connecting backend data issues to the product experiences people actually see.

Nov 16, 2018

Batch Jobs Need Owners

A reflection on why scheduled jobs become fragile when nobody clearly owns their behavior.

Aug 14, 2018

Debugging With Row Counts

A simple reliability habit that helped me understand data movement before adding bigger tools.

May 9, 2018

Schemas Are Promises

Why I started treating schemas as contracts between teams, not just database metadata.

Feb 12, 2018

Learning From My First Broken Pipeline

Early notes on why a data pipeline that runs is not always a data pipeline that works.