Menu
Log in


  • Home
  • DAMA Puget Sound - March Chapter Meeting

DAMA Puget Sound - March Chapter Meeting

  • March 10, 2026
  • 5:30 PM - 7:00 PM
  • Virtual and In-Person (Redmond, WA)
  • 71

Registration

  • Please donated to support our community!
  • Speaker Registration

Registration is closed

DAMA PUGET SOUND - CHAPTER MEETING

Tuesday, March 10, 2026 

Social / Networking: 5:30 - 6:00 pm / Presentation: 6:00 - 7:00 pm

Hybrid Meeting (Virtual and In-Person - Redmond, WA)




Guest Speaker: Arun Vivek Supramanian

From SQL to Semantics: Cost-based Optimization for LLM-Powered Database Operators


This session is designed for data architects, database administrators, data engineers, and analytics professionals who are interested in the future of database systems and the practical application of LLMs in enterprise data management.

Abstract
Modern data systems increasingly expose semantic operators powerful extensions to the relational algebra like sem_filter and sem_join that evaluate natural-language predicates over large tables using Large Language Models (LLMs). While these operators unlock unprecedented analytical capabilities, they come at a significant cost: every LLM call costs money ($0.001–$0.10) and adds latency (0.5–5 seconds), making naive execution prohibitively expensive at scale. This talk introduces the emerging field of semantic query optimization, which aims to answer a critical question for data practitioners: how can we execute these powerful semantic queries while staying within a strict budget and meeting a specific quality target? We will explore the core architectural pattern for cost-based semantic optimization, using real-world examples and findings from the research paper.
Key takeaways for attendees will include:

    1. Clear Definition of Semantic Operators: What are they, how do they extend traditional SQL, and what new analytical questions can they answer
    2. The Cost-Quality Tradeoff: A practical framework for understanding and quantifying the tradeoff between query cost, latency, and accuracy.
    3. Cost-based Planning with Conformal Cascades: An introduction to the core optimization technique, where a cheap “proxy” model handles easy rows and a powerful “oracle” model handles ambiguous cases, all while providing probabilistic quality guarantees.
    4. Real-World Case Studies: We will examine results from live experiments on fact-checking (FEVER), bioinformatics (BioDEX), and legal contract analysis (CUAD), demonstrating how cost-based optimization can reduce query costs by over 80% while maintaining high recall.

    Speaker Bio:

    Arun is a seasoned data engineering professional with over 15 years of experience building real-time data platforms and large-scale machine learning systems at companies including Amazon and Perion. He led foundational initiatives such as the supply funnel for Prime Video Ads and a conversational analytics platform for Ads Monetization. Passionate about self-serve analytics, he focused on designing production-grade architectures that democratize access to complex data. His work bridges massive-scale engineering and intelligent system design, driving multi-million dollar business impact while enabling autonomous, data-informed workflows. Today, he focuses on evolving modern data platforms to support the next generation of AI agents.


    © 2025 Data Management Association of Puget Sound (DAMA-PS) | Affiliated Chapter of DAMA International

    Powered by Wild Apricot Membership Software