Data Duets

Posts

Augmented data science: Human Intent, AI Execution

Duygu Dagli & G.T. Ozer March 09, 2026

Synergy between human intent and AI execution in data science (Image created by Nano Banana 2 based on the post content) tl;dr (never ai;dr) LLMs are now part of data science workflows. The question is no longer if we use them, but how and where . We reviewed emerging research to create a framework defining the synergy between human and machine: human intent followed by (and bounding) LLM execution . Effective AI-assisted workflows successfully decouple intent (the "what" and "why" ) from execution (the "how" ). In this model, the data scientist acts as the orchestrator , while the LLM serves as the execution engine . Success hinges on assigning intent to the correct tasks. The data scientist sets goals and validates assumptions . The LLM searches well-defined spaces and executes code subject to human validation. This post sets the stage for a new series where we hope to introduce a task-lev...

Using generative models, well, to generate data

Duygu Dagli & G.T. Ozer February 16, 2026

Distributions of the real vs. synthetic data for selected variables tl;dr (never ai;dr) One underappreciated use case of generative models is effectively creating realistic tabular datasets that preserve the underlying statistical properties of the original data. Leading libraries for data synthesis include Synthetic Data Vault, YData-Synthetic, and Synthcity. Practical applications include navigating the bottlenecks of sharing sensitive data with vendors or augmenting datasets for rare events, such as product recalls. Ultimately, this approach enables a data-centric workflow even when data is scarce or biased, ensuring models are trained on a high-fidelity representation of reality. Podcast-style summary by NotebookLM Introduction How can we use generative models beyond large language model...

Can GenAI accelerate the adoption of optimization?

Duygu Dagli November 17, 2025

Image courtesy of Andertoons Podcast-style summary by NotebookLM Solo post: Director's cut The title, "Democratizing Optimization with Generative AI," reflects the approach of a recent paper that investigates why businesses fail to adopt advanced optimization models and assesses whether Generative AI (GenAI) can help bridge that gap. 1 This is interesting. The authors argue that Generative AI (GenAI) can: Offer an intuitive layer to provide visibility into the inputs (Insight), Make the model logic and constraints transparent (Interpretability), and Rapidly respond to change and create what-if scenarios (Interactivity and Improvisation). This is called 4I Framework, and the paper provides a proof of concept from Microsoft’s Cloud Supply Chain team. Most of the insights shared in the paper resonated with my own experience building and deploying optimization models. Optimization has always been too opaque for the teams who would benefit from it the most. Why? Beca...

A look back to look forward: Where do new ideas come from?

Duygu Dagli & G.T. Ozer September 08, 2025

Image courtesy of Entertainment Weekly Podcast-style summary by NotebookLM Introduction to the business case This was going to be a solo post, but I bring the Director into the conversation by asking her a question at the end. The inspiration for the article comes from the WSJ piece "Meet the United Airlines Executive Who Gets to Pick Its Hot New Routes" here . Alison Sider, a fellow Longhorn from my alma mater, UT Austin, interviews Patrick Quayle, the senior vice president of global network planning and alliances at United Airlines. She asks interesting questions on a topic of interest. 1 How does United Airlines decide where to fly, and where not to fly? If a route is a proven cash cow, flying it is an easy decision, but of course, all the competitors are already flying it too. Everyone knows the answer; the only thing left is optimization. Finding a successful route no other competitor has yet flown is a more difficult problem because there's no data, and thi...

Using AI methods and computational tools to mitigate tariff uncertainty

Duygu Dagli & G.T. Ozer June 09, 2025

Image courtesy of Tom Fishburne Podcast-style summary by NotebookLM Introduction to the business case This time, we are inspired by two articles that focus on how the recent wave of U.S. tariffs has created significant uncertainty for global supply chains and how AI can help address these challenges. One of the articles is from The Wall Street Journal (WSJ) and the other is from CNBC. You can read the WSJ article "AI Can’t Predict the Impact of Tariffs — but It Will Try" here and the CNBC article "Companies turn to AI to navigate Trump tariff turbulence" here . Sudden tariff increases on imports from China, Canada, and Mexico have forced companies to quickly reevaluate their sourcing strategies, inventory levels, and supplier relationships. Although machine learning methods and AI-powered tools can help companies analyze risk, model tariff scenarios, and optimize logistics, these tools often struggle to predict the impact of abrupt and rare changes in the en...

Measuring long-term outcomes using short-term data and surrogates

Duygu Dagli March 18, 2025

Image courtesy of Cai et al. (2023) Podcast-style summary by NotebookLM Solo post: Director's cut When measuring the outcomes of an intervention, organizations usually observe and quantify immediate or short-term results. For example, marketing could drive additional traffic, a discounted shipping rate could increase conversion rates, a price promotion or a loyalty program could drive sales. In most cases, however, these interventions would have effects that materialize over a longer period of time. After being exposed to a promotion, customers may become more price sensitive and start buying cheaper products or strategically time their purchases to take advantage of the next promotion. In general, companies will not conduct multi-month (or even multi-year) experiments to compare alternatives and find the option that optimizes long-term return on investment (ROI). Decisions must be made in the absence of long-term results. To address this shortcoming, in 2019, Susan Athey et ...

Eat Mor Chikin, and fast? The story of Chick-fil-A’s multimodal data analysis and optimization

Duygu Dagli & G.T. Ozer March 03, 2025

Image courtesy of Imago / Pubity Illustration - pubity.com Podcast-style summary by NotebookLM Introduction to the business case The Wall Street Journal article reveals how Chick-fil-A is using innovative data collection and analysis methods to optimize its drive-through operations. The company has developed a "Film Studies" unit that combines drone footage with security camera data to create comprehensive "game films" of its restaurant operations. This multimodal data collection approach, inspired by NFL game analysis, allows the company to model traffic patterns, identify operational bottlenecks, and analyze service efficiency in drive-through operations. Read the article here . The data-centric insights have led to significant operational improvements, including the development of new restaurant designs with elevated kitchens and multiple drive-through lanes capable of serving 700 cars per hour . The analysis also helped optimize staffing patterns, identi...