Laid off? In between positions? Take advantage of our Open-To-Work discount.
Who will speak at Data Day Texas
We are continuing to announce confirmed speakers. However, speaking proposals are now closed. For the latest news, follow us on Linkedin.
Lena Hall (Seattle)

Lena will be presenting the Saturday Data Day Texas session:
Context >Prompts: Context Engineering Deep Dive
Shachar Meir (London)

Alexandra Pasi (Salt Lake City)

Alexandra will be presenting the following Saturday Data Day Texas session:
Learning Beyond Language: A New Geometric Paradigm for Better ML
Ole Olesen-Bagneux (Copenhagen) @olesenbagneux
Ole Olesen-Bagneux (Linkedin) rethinks data and tech by providing perspectives from Library and Information Science. He holds a PhD in Information Science from the University of Copenhagen, Denmark, where he lectured in courses pivotal for data cataloging, such as Knowledge Organization and Information Retrieval. Ole is author of The Enterprise Data Catalog (O’Reilly). Ole is also author of the upcoming Fundamentals of Metadata Management (O'Reilly, 2025), in which he introduces a completely new architecture for metadata that he calls the Meta Grid. Standing on the shoulders of microservices, which liberated operational data, and data mesh, which liberated analytical data, the Meta Grid aims to liberate metadata. Follow Ole on Medium, and learn more about Meta Grid at Searching For Data.
Ole will be presenting the following Data Day Texas session:
Letter from a fish: Computing, objectivity and the limits of AI. A response til Bill Inmon
Kierra Dotson (Austin)

As the CEO and founder of The Data Bloq, Kierra provides cutting-edge data and GenAI strategy consulting, empowering organizations to harness the full potential of their data assets. Her impact extends far beyond corporate environments; Kierra has conducted over 150 tech consultations and resume revamps via The Data Bloq, helping professionals secure over $250,000+ in salary increases. Follow Kierra at The Data Conversationalist.
Hannes Mühleisen (Amsterdam)

Clair Sullivan (Breckenridge, Colorado) @cjlovesdata1

Clair will be presenting the following Saturday Data Day Texas session:
Your Skills, Your Business: Layoff-Proof Your Career through Solopreneurship
Clair will also lead the following Sunday Workshop:
Build Your Solopreneur Roadmap: A Workshop for Data Professionals
Data Governance Keynote
Winfried Etzel (Stavanger, Norway)
An increasingly prominent voice in the global data community, Winfried Etzel champions Data Governance, Data Strategy, and organizational design. Through his #MetaDAMA podcast and his work building the Nordic data community, Winfried drives professional development across the region. His upcoming book, «Data Governance in the Wild», explores how Data Governance must evolve for distributed landscapes, automation, and AI.
Catch Winfried's insights on the Data Democracy Podcast, Catalog and Cocktails., and his A Journey around the World of Data with Joe Reis.
Winfried will be presenting the following session:
Existence over Essence? Data Governance in times of AI
Ontology Keynote
Jessica Talisman (Santa Cruz) @jtalisman

Jessica will be presenting on the Ontology Pipeline.
MF Joe Reis (Salt Lake city) @joereis

Abi Aryan (Lisbon)

Bill Inmon (Castle Rock, Colorado)

Thais Cooke (Raleigh-Durham-Chapel Hill)

Thais has shared insights on popular industry podcasts including Data Podcast for Nerds!, Mavens of Data, and How to Get an Analytics Job, and has been featured as a speaker at MDS Fest. Thais is dedicated to making data accessible, focusing on practical solutions that connect technical concepts with real-world applications. When not immersed in datasets and visualizations, Thais can be found buried in books, writing, experimenting with new recipes in the kitchen, or enjoying family time.
Thais will be presenting the following Saturday Data Day Texas session:
Andrew Madson (Phoenix)

Andrew will be presenting the following Saturday Data Day Texas session:
Iceberg for Agents - Elevating Lakehouse Data Into AI-Ready Context
Sarah McKenna (NYC)

A Georgetown University graduate, Sarah began her career in finance before pivoting to technology, where she developed deep expertise in quality assurance and large-scale, data-driven automated operations. Her leadership has been instrumental in Sequentum's evolution from a web scraping tool provider to a comprehensive data solutions company, culminating in the recent launch of Sequentum Cloud, their next-generation PaaS platform for web data extraction.
Sarah has guided several notable data technology companies to successful acquisitions, including Summit Systems (acquired by Misys), Vitech (acquired by CVC Capital Partners), and Massive Incorporated (acquired by Microsoft). Her focus on establishing ethical standards and best practices in web data collection has helped shape the rapidly evolving alternative data marketplace.
Jordan Morrow (Salt Lake City)

When not found within his work of Data, Jordan is married with 5 kids. Jordan loves fitness and has run multiple ultra marathons. He loves to travel with his wife and family. Jordan loves to read, often reading (or using Audible) to go through multiple books at a time. Jordan is the author of three books: Be Data Literate, Be Data Driven, Be Data Analytical, and the just published Business 101 for the Data Professional in December 2024.
Jenna Jordan (Ashville)

During her time as a senior consultant at Analytics8, Jenna developed particular expertise in dbt Mesh architecture and the governance strategies that accompany it. Her experience working with dbt Mesh led to a peer exchange session at Coalesce 2024, where she helped attendees explore governance challenges through a role-playing simulation game. Jenna spearheaded the adoption of dbt at the City of Boston Analytics Team, where she architected and built the project and from scratch, and reorganized the data warehouse.
An occasional blogger on topics like analytical data warehouses and data engineering best practices, Jenna is also a passionate community builder and founder of the City Analytics Exchange, a network for data analytics practitioners in local government. When not transforming data, she's a knitter, board gamer, and dog mom.
David Hughes (Seattle)

David will be presenting the following session:
Observability, Evaluation, and Guardrails for Self-Optimizing Agents
Matthew Mullins (Raleigh-Durham-Chapel Hill)

Matthew will be presenting the following session:
DataOps Is Culture, Not a Toolchain
Chris Brousseau (Salt Lake City)

Russell Spitzer (New Orleans)

#iceberg
Paul Blankley (Denver)
Paul Blankley (Linkedin) is Co-founder and CTO of Zenlytic. With a master's degree in AI from Harvard, Paul made what he now describes as a wonderfully naïve decision: to build a business intelligence tool - unaware that among investors, BI tools share the same "never build this" reputation as ERP systems.
Paul and co-founder Ryan Janssen started building Zenlytic in 2020, before ChatGPT existed, betting that large language models represented a fundamental platform shift for BI. While working as a data consultant setting up data stacks, Paul noticed that the questions that really mattered were never the ones that fit neatly on dashboards - they were the ones that required emailing a human analyst. He set out to solve that problem by creating an AI that could genuinely understand business context and answer questions like a mid-to-senior level data analyst.
As LLM capabilities exploded in late 2022, Zenlytic's architecture - built from the ground up for conversational interaction—positioned the company at the forefront of agentic BI. The company has since raised a $9M Series A led by M13, with backing from Bain Capital Ventures and others.
Paul approaches AI with a healthy skepticism shaped by teaching his 12-year-old son to use ChatGPT for math homework—and watching convincing but mathematically wrong answers emerge. His philosophy: "If you don't understand the process, LLMs are just gasoline. You'll go fast somewhere, but whether that's where you should be going is up to you." That conviction in human-in-the-loop design runs throughout Zenlytic's architecture.
Beyond his technical work, Paul has created AI-generated cubist art and previously held R&D positions at Roche and Zimmer Biomet. When not building the future of business intelligence, he's rock climbing and snowboarding in the mountains around Denver, Colorado. Check out Paul's appearance on The Joe Reis Show.
Matthew Sharp (Salt Lake City)
Co-author of LLMs in Production (Manning), Matthew Sharp (Linkedin) is a seasoned expert in the world of machine learning and artificial intelligence. With over 10 years of experience, Matthew has worked across the entire ML/AI spectrum—from data science to MLOps infrastructure—deploying models to production and building the tools and platforms to support them. Currently, Matthew is an AI Engineer at Flexion, where he leads the advancement of Flexion’s AI application development, with a focus on Generative AI. He also teaches a graduate-level course on the development, deployment, optimization, and real-world applications of large language models (LLMs) at Utah State University. Matthew will be appearing as part of the #SLCdata invasion.
Matthew will be presenting the following Data Day Texas session:
How to Hack An Agent in 10 Prompts: and other true stories from 2025
Dipankar Mazumdar (Toronto) @Dipankartnt
Dipankar Mazumdar (Linkedin / GitHub) is currently the Director of Developer Advocacy at Cloudera, where he leads worldwide developer initiatives focused on lakehouse and Generative AI. Before this, he served in developer advocacy roles at Dremio, Onehouse, and Qlik, where he contributed to open-source projects such as Apache Iceberg, Apache Hudi & XTable among others. For most of his career, Dipankar has worked at the intersection of Data Engineering and AI. He is also currently authoring the book "Engineering Lakehouses with Open Table Formats." Dipankar has been a speaker at numerous conferences such as Databricks' Data+AI, ApacheCon, and Data Day Texas, among others.
Sanjeev Mohan (San Francisco)

Vaibhav Gupta (Seattle)
Across nearly a decade in software engineering, Vaibhav Gupta (Linkedin) has built predictive pipelines at D. E. Shaw, Augmented Reality systems at Google, and real-time 3D reconstruction at Microsoft HoloLens. His tenure at Google included leading performance optimizations for ARCore and the Pixel 4 face unlock, along with significant contributions to depth algorithms for the Pixel Visual Core. He also founded LifePlusPlus, a computer science bootcamp for non-traditional tech hires.
Currently, Vaibhav is CEO and Co-Founder of Boundary, (YC W23), creator of BAML - the first domain-specific programming language designed specifically for structured data extraction from LLMs. BAML achieves state-of-the-art results in function-calling with GPT 3.5 over all other models and techniques, including OpenAI's new strict structured outputs. What makes BAML revolutionary is its Schema-Aligned Parsing (SAP) algorithm - instead of constraining LLM outputs upfront (which often fails), BAML fixes broken JSON like trailing commas, unquoted keys, unescaped quotes, new lines, and even fractions in milliseconds post-generation, making cheaper models perform like expensive ones.
Vaibhav holds a BS in Computer Science and Electrical Engineering from UT Austin.
Data Visualization Keynote
Christian Miles (Vancouver Island)
Well known for his widely-read graph visualization newsletter source/target (2021-2024), Christian Miles (Linkedin) specializes in graph database visualization and analytics. Since completing his Masters in Mathematics and Computer Science from the University of Bristol, his work has spanned fraud detection, cybersecurity, and law enforcement at BAE Systems, Wynyard Group, and Cambridge Intelligence.
Christian recently joined the red-hot graph visualization company G.V() - answering the question many in the graph community had been asking (where's Christian going). Christian is a big picture guy with a reputation for simplifying the complex. If your work intersects with either graph or visualization - don't just attend his talk. Reach out and schedule time to meet with him in person.
Christian will be presenting the Saturday Data Visualization Keynote:
"Who Needs a Chart When You Can Just Chat?" - the role of data visualization in a post-LLM world
Glauber Costa (Dallas)
Glauber Costa (Linkedin) first spoke at Data Day Texas in 2018, and people have been asking us to invite him back ever since. Currently the founder and CEO of Turso, Glauber is leading the effort to build the next evolution of SQLite - in Rust.
Before founding Turso, Glauber spent over 20 years in systems programming: as a Staff Software Engineer at Datadog (where he authored the Glommio Rust async executor), Distinguished Engineer at ScyllaDB (designing core database features during its evolution from concept to production-grade Cassandra alternative), and as a core contributor to the Linux Kernel at Red Hat, focusing on virtualization, storage, and containers.
At Turso, Glauber also leads the development of libSQL—an open-contribution fork of SQLite with 12,000+ GitHub stars—and its namesake database, Turso, a complete rewrite of SQLite in Rust with deterministic simulation testing built in from the ground up. Turso powers production workloads for Astro DB, Val.town, and installations like U2's immersive experience at The Sphere in Las Vegas. The company has raised $7M from Norwest Venture Partners and is backed by a roster of infrastructure-focused investors.
Glauber is known for making complex distributed systems concepts accessible and for his pragmatic approach to evolving foundational technology without breaking compatibility.
Glauber will be presenting the following Data Day Texas session:
We're Rewriting SQLite in Rust. Here's Why That's Not Crazy.
Trey Blalock (Portland)
A highly respected Chief Information Security Officer and security researcher, Trey Blalock (Linkedin) has performed extensive work in almost every security domain for some of the world's largest corporations and governments. Trey has trained thousands of people on advanced security topics, and has taught security classes at many organizations - including AT&T, BCBS, BECU, CIA, CISA, DHS, DIA, FBI, IBM, NSA, RCMP, T-Mobile, U.S. Air Force, U.S. Army, U.S. Marines, U.S. Navy, U.S. Secret Service. He has served as a Computer Forensic Expert Witness for the U.S. Department of Justice on multiple cases, including handling all aspects of computer forensics on some high-profile cases such as "Donald Vance vs. Donald Rumsfeld," "John Doe vs. Donald Rumsfeld" and "American Boat Company vs. United States.
As Chief Information Security Officer for Coinstar, Trey managed several teams across multiple projects during a major overhaul of the company's infrastructure, architecting significant changes to protect over 25,000 kiosks and data operations on several cloud platforms, reducing the attack surface by more than 95%.
Trey also specializes in defending large-scale systems from advanced threat actors, and currently serves on several forensic, red teaming, and penetration testing advisory boards. Through his consulting practice, Verification Labs, Trey has managed hundreds of security events for companies, including dozens of ransomware events, security breaches, denial-of-service attacks, and over one hundred forensic incidents.
Prashanth Rao (Toronto)

In recent years, Prashanth has worked on a variety of data engineering, data science, and machine learning problems and has thought deeply about databases and data modeling paradigms. He has two master's degrees: one in Aerospace engineering from the University of Michigan, and another in Computer Science from Simon Fraser University in Vancouver. Prashanth’s primary interests include Natural Language Processing (NLP), information extraction, graph theory and database systems. In his spare time, Prashanth enjoys hiking, biking, trying out new cuisines, engaging with the AI developer community, and blogging about all things data at thedataquarry.com. Check out his most recent blog post, Why I'm excited to work at LanceDB.
Weimo Liu (Sunnyvale)

Weimo will be hosting a PuppyGraph: Ask me Anything session.
Hydra Keynote
Joshua Shinavier (San Francisco) @joshsh

As one of the co-founders of what is now Apache TinkerPop, Josh contributed to the 1) first common APIs for graph databases, 2) the original TinkerPop query language which influenced Gremlin, and 3) the first tools which aligned the property graph and RDF data models, starting with neo4j-rdf-sail in 2008.
While at Uber, Josh led the company-wide effort to unify data models and schemas across RPC, streaming, and storage. The scope of this effort included developing standardized schemas, propagating standardized schemas throughout the company's infrastructure, developing mappings to integrate data across languages and environments, and getting as much as possible of Uber's data connected in the form of a graph of entities and relationships, facilitating data discovery and automated query planning.
Over the last decade, Josh has been working on what has finally emerged as Hydra. To get a glimpse into Hydra's early evolution, check out the videos of Josh's 2019 Data Day Texas session A Graph is a Graph is a Graph and his 2020 Data Day Texas session TinkerPop 2020, and the slides from his Data Day Texas 2022 session: Transplilers Gone Wild : Introducing Hydra.
Arthur Bigeard (Glasgow, Scotland)

And so G.V() was born.
Fast forward to the present, and G.V() currently supports Neo4j, Amazon Neptune, Spanner Graph, Dgraph, Kuzu, PuppyGraph, Memgraph, and adds support for additional graph databases almost weekly.
Mark Freeman (Sacramento)
Mark Freeman (Linkedin) is a data scientist turned data engineer with a deep obsession for data quality. As the Tech Lead at Gable, Mark builds internal systems and data products that drive go-to-market strategies, leveraging his extensive experience in creating robust, scalable data solutions. He is also the first employee at Gable where he aims to help bring a data contract solution to market. Mark is co-author of the upcoming O’Reilly book: Data Contracts, in which he shares insights and best practices on ensuring reliable, high-quality data flows within organizations. With a passion for turning complex data challenges into actionable solutions, Mark is committed to advancing the field of data engineering and fostering a culture of trust in data across the industry. Check out Mark's courses on Linkedin Learning.
Shane Gibson (London)

Jon Haddad (Redondo Beach)

Apache Iceberg Keynote
Tim Berglund (Mountain View) @tlberglund
Tim Berglund
Jans Aasman (SF Bay)
Jans Aasman (Wikipedia / LinkedIn) is a Ph.D. psychologist and expert in Cognitive Science - as well as CEO of Franz Inc., an early innovator in Artificial Intelligence and provider of the graph database, AllegroGraph. As both a scientist and CEO, Dr. Aasman continues to break ground in the areas of Artificial Intelligence and Knowledge Graphs as he works hand-in- hand with numerous Fortune 500 organizations as well as US and Foreign governments. Dr. Aasman spent a large part of his professional life in telecommunications research, specializing in applied Artificial Intelligence projects and intelligent user interfaces. He gathered patents in the areas of speech technology, multimodal user interaction, recommendation engines while developing precursor technology for tablets and personal assistants. He was also a professor in the Industrial Design department of the Technical University of Delft. Dr. Aasman is a noted conference speaker at such events as Smart Data, NoSQL Now, International Semantic Web Conference, GeoWeb, AAAI, Enterprise Data World, Text Analytics, and TTI Vanguard to name a few.
Jonathan Ellis (Austin) @spyced

Seeing the promise of coding with AI while building JVector, and experiencing the frustration of seeing the same tools completely fail with Cassandra (a 10x larger codebase), Jonathan was motivated to create Brokk - a tool to tame large codebases for AI.
Adriano Vlad-Starrabba (London)

Juan Sequeda (Austin) @juansequeda

Juan has researched and developed technology on semantic data virtualization, graph data modeling, schema mapping and data integration methodologies. He pioneered technology to construct knowledge graphs from relational databases, resulting in W3C standards, research awards, patents, software and his startup Capsenta acquired by data.world in 2019. Juan strives to build bridges between academia and industry as former co-chair of the LDBC Property Graph Schema Working Group, member of the LDCB Graph Query Languages task force, standards editor at the World Wide Web Consortium (W3C). Juan continues to be an active member of the scientific community through academic research partnerships, advising students, and member of data and AI scientific conference committees.
Alex Merced (Winter Park, FL) @alexmerced
Alex Merced (Linkedin) has a history of creating content to enable developers of all types through his personal projects like DevNursery.com, The Web Dev 101 Podcast, and the DataNation podcast. Currently Head of DevRel at Dremio, Alex has held positions with companies like Crossfield Digital, CampusGuard, GenEd Systems and others along with being an Instructor for General Assembly Bootcamps. Alex is co-author of Apache Iceberg: The Definitive Guide and the upcoming Apache Polaris: The Definitive Guide, both from O'Reilly.
#oreilly-showcase
Jean-Georges Perrin (Albany, New York) @jgp

Principal Enterprise Data Architect at Expedia, and Group Intelligence Platform Lead at PayPal, Jean-Georges is currently Senior Product Manager at Actian. In addition, Jean-George is Chair of the Technical Steering Committee for Bitol, which works toward an Open Data Contract Standard.
Jean-Georges is author of Spark in Action from Manning, and co-author Implementing Data Mesh from O'Reilly. Check out his thoughts on Data Mesh at Youtube
Jean-Georges will be presenting the following Saturday Data Day Texas session:
Hands-on Data Product: let's build a data product in 30 minutes
Matthew Housley (Salt Lake city)

Jonathan Mugan (Austin)
Jonathan Mugan (Linkedin), Principal Scientist at De Umbra, is a researcher specializing in artificial intelligence, machine learning, and natural language processing. His current research focuses in the area of deep learning for natural language generation and understanding. Dr. Mugan received his Ph.D. in Computer Science from the University of Texas at Austin. His thesis was centered in developmental robotics, which is an area of research that seeks to understand how robots can learn about the world in the same way that human children do. Dr. Mugan also held a post-doctoral position at Carnegie Mellon University, where he worked at the intersection of machine learning and human-computer interaction. One of the most requested speakers at the Data Day Texas conferences, he recently also spoke on the topic of NLP at the O’Reilly AI conference, and is the creator of the O’Reilly video course Natural Language Text Processing with Python. Dr. Mugan is also the author of The Curiosity Cycle: Preparing Your Child for the Ongoing Technological Explosion.
Jonathan will be presenting the following Data Day Texas session:
LLMs Expand Computer Programs by Adding Judgment























