Deep Dives | Towards Data Science

Building an Evaluation Harness for Production AI Agents: A 12-Metric Framework From 100+ Deployments

Agentic AI

A 12-metric evaluation framework for production AI agents — covering retrieval, generation, agent behavior, and…

Pratik R

May 13, 2026

19 min read

Hybrid Search and Re-Ranking in Production RAG

Large Language Models

When semantic search isn’t enough for the RAG

Priyansh Bhardwaj

May 12, 2026

16 min read

Batch or Stream? The Eternal Data Processing Dilemma

Data Engineering

“Should we process our data in batches or in real-time?” It’s not batch vs. stream:…

Nikola Ilic

May 10, 2026

14 min read

Close-up photograph of a printed document with portions of text redacted by black bars over Latin filler text.

LLM Summarizers Skip the Identification Step

LLM Applications

A practitioner’s argument that meeting summarizers fail in the same way regressions fail when you…

William Gieng

May 10, 2026

15 min read

RAG Is Blind to Time — I Built a Temporal Layer to Fix It in Production

Large Language Models

Three weeks into testing, a learner told me my AI tutor gave her the wrong…

Emmimal P Alexander

May 9, 2026

24 min read

When Customers Churn at Renewal: Was It the Price or the Project?

Data Science

A practitioner’s guide to causal attribution when two churn drivers arrive at once.

William Gieng

May 8, 2026

14 min read

The Joy of Typing

Programming

A practical guide to modern type annotations in Python for data science

David Conneely

May 7, 2026

18 min read

Timer-XL: A Long-Context Foundation Model for Time-Series Forecasting

Deep Learning

Exploring the inner workings of a decoder-only Transformer foundation model

Nikos Kafritsas

May 6, 2026

14 min read

Discrete Time-To-Event Modeling – Predicting When Something Will Happen

Data Science

Part 1: The basics — discretization of time, censoring and the life table

Jarom Hulet

May 5, 2026

11 min read

Surviving High Uncertainty in Logistics with MARL

Reinforcement Learning

Part 2. Building scale-invariant agents that seamlessly change contexts

Alexander Levin

May 5, 2026

11 min read