Who will speak at Data Day Texas 2020

Take advantage of our discount rooms at the conference hotel.


We are beginning to announce speakers for 2020. Want to join us as a speaker? Check out our proposals page.

Confirmed Speakers

Jon Allen (San Francisco)

Jon Allen is a Senior Data Scientist at SyncThink and a Founder of / Stand-up Comedian at Cheaper Than Therapy. Jon is a physicist who studied at UT Austin’s Center for Relativity. After leaving academia, Jon worked with start-ups from MIT’s Media Lab on automated gait analysis and, later, co-founded Ravel in 2010, which specialized in large scale data solutions for corporate marketing groups. Jon moved out to the Bay Area in 2012 and has worked extensively as a data scientist in the medical and hardware spaces. He also started, runs, and regularly performs in one of the largest independent comedy clubs in the US, Cheaper Than Therapy.

Jesse Anderson (Reno) @jessetanderson

Jesse Anderson is a data engineer, creative engineer, and managing director of the Big Data Institute. He works with companies ranging from startups to Fortune 100 companies on Big Data. This includes training on cutting edge technologies like Apache Kafka, Apache Hadoop and Apache Spark. He has taught over 30,000 people the skills to become data engineers. He is widely regarded as an expert in the field and for his novel teaching practices. Jesse is published on O’Reilly and Pragmatic Programmers. He has been covered in prestigious publications such as The Wall Street Journal, CNN, BBC, NPR, Engadget, and Wired. You can learn more about Jesse at Jesse-Anderson.com.

Dave Bechberger (Houston) @bechbd

Dave Bechberger is a Solution Architect in the Graph Practice at DataStax where he helps customers build large distributed graph backed applications. Prior to that he was the Chief Architect at Gene by Gene, a genetic genealogy and bioinformatics company, where he worked to migrate their legacy technology stack to modern technologies including heavy use of graph databases and Cassandra. Dave has spent his career engaging in full stack software development but specializes in building data architectures in complex data domains such as bioinformatics, oil and gas, supply chain management, etc. He uses his knowledge of graph and other big data technologies to build out highly performant and scalable systems. Dave has previously spoken at a variety of national and international technical conferences including NDC Oslo, NDC London, as well as previous GraphDay conferences in Texas, San Francisco and Seattle.

Tim Berglund (Denver) @tlberglund

Tim Berglund is a teacher, author, and technology leader with Confluent, where he serves as the senior director of developer experience. Tim can frequently be found at speaking at conferences internationally and in the United States. He is the co-presenter of various O’Reilly training videos on topics ranging from Git to distributed systems and is the author of Gradle Beyond the Basics. He tweets as @tlberglund, blogs very occasionally at timberglund.com, and is the cohost of the DevRel Radio Podcast. He lives in Littleton, Colorado, with the wife of his youth and their youngest child, the other two having mostly grown up.

Michael Berthold (Konstanz)

Michael Berthold is currently president of KNIME.com AG and co-creator of KNIME (wikipedia entry), the open analytics platform used by thousands of data experts around the world. Since August 2003, Michael has been the Nycomed-Chair for Bioinformatics and Information Mining at Konstanz University, Germany where his research focuses on using machine learning methods for the interactive analysis of large information repositories in the Life Sciences. Previously he held positions in both academia (Carnegie Mellon, UC Berkeley) and industry (Intel, Tripos).
Michael is Past President of the North American Fuzzy Information Processing Society, Associate Editor of several journals and the President of the IEEE System, Man, and Cybernetics Society. He has been involved in the organization of various conferences, most notably the IDA-series of symposia on Intelligent Data Analysis and the conference series on Computational Life Science. Together with David Hand he co-edited the textbook Intelligent Data Analysis: An Introduction which has recently appeared in a completely revised, second edition. He is also co-author of Guide to Intelligent Data Analysis (Springer Verlag) which appeared in summer 2010. When time permits Michael still writes code.

Chris Davis (Dallas) @phoo

Dr. Chris Irwin Davis is a professor of computer science at the University of Texas at Dallas who teaches database theory and design. He also has 15 years of experience working for Fortune 500 companies in data management and software development lifecycle.
Chris will present the following session: Automated Encoding of Knowledge from Unstructured Natural Language Text into a Graph Database


Graham Ganssle (Austin) @grahamganssle

Graham Ganssle (LinkedIn / GitHub) loves data. As Head of Data Science at Expero, his favorite part of work is daydreaming up innovative solutions to quantifiable problems and planning an implementation strategy. Building intelligent systems is his passion whether it’s automated derivatives trading bots, adaptive image processing algorithms, or autonomous musical composers. Whether deep learning is the optimal solution or not, helping customers succeed through solving their analytics problems is where Graham finds the most satisfaction.
Graham Ganssle’s physics Ph.D. focused on digital signal processing, specifically on a (then) new optimization method which used naturally coupled wavefields to stabilize convergence. He also holds a masters degree in applied physics and a professional geoscientist license. Graham worked in the oil and gas vertical for ten years, performing data science and quantitative geophysics for clients around the world. He has numerous publications on a variety of scientific topics and has been awarded both scientific and business achievement awards.
Off the clock, Graham’s an inept aspiring rock climber and a triathlete. He’s constantly imploring his bride (and, more successfully, his puppy) to get muddy with him on the trail. Most Saturday mornings you can find Graham clacking away at his keyboard on his newest experiment or craziest inspiration.
Graham will present the following session: Productionizing Deep Learning in Health Care

Michael Grove (Washington, DC) @mikegrovesoft

Michael Grove is VP of Engineering and co-founder of Stardog where he oversees the development of the Stardog Knowledge Graph Platform. Michael studied Computer Science at the University of Maryland and is an alumnus of its well-regarded MIND Lab which specialized in semantic technologies. Before Stardog, he worked at Fujitsu Resarch on the use of graphs and semantic technologies in pervasive computing environments. Michael is an expert in large scale database and reasoning systems and has worked with graphs and graph databases for nearly fifteen years.

Dr. Denise Koessler Gosnell (Charleston) @DeniseKGosnell

Dr. Denise Gosnell leads a team at DataStax which builds some of the largest, distributed graph applications in the world. Her passion centers on examining, applying, and evangelizing the applications of graph data and complex graph problems. As an NSF Fellow, Dr. Gosnell earned her Ph.D. in Computer Science from the University of Tennessee. Her research coined the concept of "social fingerprinting" by applying graph algorithms to predict user identity from social media interactions.​ ​Since then, Dr. Gosnell has built, published, patented, and spoke on dozens of topics related to graph theory, graph algorithms, graph databases, and applications of graph data across all industry verticals.
Dr. Gosnell was one of the most highly rated speakers at Data Day Texas 2019. We're happy that she will be returning for 2020.

Rob McDaniel (Seattle)

Rob McDaniel is the CTO at Sigma IQ, a company that provides artificial intelligence for corporate finance. . Most recently, Rob was Manager of Applied Science at Rakuten, where he managed the AI that expands the depth and quality of Rakuten's global product catalog.

Rob has a diverse background in engineering and machine learning, both with major corporations and startups. He has worked on problems related to machine translation, taxonomy classification and information extraction, and has a passion for unsupervised methods and graph theory.

Human in the Loop Keynote
Robert Munro (San Francisco ) @WWRob

Robert Munro (LinkedIn) most recetly was Chief Technology Officer at Figure-Eight (formerly known as Crowdflower). Previously, he ran Product for AWS's first Natural Language Processing services in the Deep Learning team at Amazon AI. Robert is an expert in combining Human and Machine Intelligence, working with Machine Learning approaches to Text, Speech, Image and Video Processing. Robert has founded several AI companies, building some of the top teams in Artificial Intelligence. He has worked in many diverse environments, from Sierra Leone, Haiti and the Amazon, to London, Sydney and Silicon Valley, in organizations ranging from startups to the United Nations. Robert has published more than 50 papers, has a PhD from Stanford University.
Rob will be leading an entire track of sessions on Human in the Loop machine learning.

Josh Perryman (Bryan / College Station) @joshperryman

Josh Perryman likes to play with data. Oftentimes this is implementing proprietary algorithms closer to the data for performance or scale. Sometimes it is ad-hoc investigation and analysis, a sort of exploratory querying. A few times he’s been able to leverage his experience with data engines for dramatic performance improvements. But the real joy is designing a schema for both functionality and performance, one which increases the productivity of other developers and enables a technology to solve new problems or deliver new value to the business.
Technology isn't just data, and Josh does more than just play with data. He’s worked with high performance computing (HPC) environments, taking computations from hours to minutes or seconds. He has built visualizations which deliver new insights into complex data domains. He’s managed technology personnel, both directly and indirectly, to deliver technology solutions. Josh has put together more types of technology components, software and hardware, than can be counted, because one of his fortes is solving problems by building sustainable systems. Josh is currently Director of Product Development, Graph Solutions at VeracityID.

Database Keynote
Dr. Marko A. Rodriguez (Santa Fe) @twarko

Dr. Marko A. Rodriguez (LinkedIn) is a graph and stream computing specialist currently focused on designing stream-based virtual machines for processing graph-based structures within distributed computing environments. Marko is the co-founder of Apache TinkerPop where he is developing the next generation TinkerPop4 virtual machine and bytecode specification that will enable the natural integration of any data processor and query language. Marko is also the founder of RReduX which, along with developing TinkerPop4, is designing a universal distributed computer called GMachine. Dr. Rodriguez received his Ph.D. in computer science from the University of California at Santa Cruz and was a Director's Fellow at the Center for Nonlinear Studies at the Los Alamos National Laboratory.

Juan Sequeda (Austin) @juansequeda

Dr. Juan Sequeda is the co-founder of Capsenta, a spin-off from his research, and the Senior Director of Capsenta Labs. He holds a PhD in Computer Science from the University of Texas at Austin. His research interests are on the intersection of Logic and Data and in particular between the Semantic Web and Relational Databases for data integration, ontology based data access and semantic/graph data management. Juan is the recipient of the NSF Graduate Research Fellowship, received 2nd Place in the 2013 Semantic Web Challenge for his work on ConstituteProject.org, Best Student Research Paper at the 2014 International Semantic Web Conference and the 2015 Best Transfer and Innovation Project awarded by Institute for Applied Informatics. Juan is the General Chair of AMW 2018, was the PC chair of the ISWC 2017 In-Use track, is on the Editorial Board of the Journal of Web Semantics, member of multiple program committees (ISWC, ESWC, WWW, AAAI, IJCAI) and co-creator of the Consuming Linked Data Workshop series. Juan is a member of the Graph Query Languages task force of the Linked Data Benchmark Council (LDBC) and has also been an invited expert member and standards editor at the World Wide Web Consortium (W3C).

TinkerPop Keynote
Joshua Shinavier (San Francisco) @joshsh

Joshua Shinavier is a primordial being of the graph database domain, and holds a PhD in Web science from RPI’s Tetherless World Constellation. He contributed to the first common APIs for graph databases, the original TinkerPop query language which influenced Gremlin, and the first tools which aligned the property graph and RDF data models, starting with neo4j-rdf-sail in 2008. Other graphy adventures have include Lisp hacking at Franz Inc. and Java hacking at Aurelius. As of 2017, he is part of the knowledge graph team at Uber, where he also leads a company-wide effort to unify schemas across RPC, streaming, and storage. He feels, now as ever, that the research, business, and open source communities have a lot to learn from each other with respect to graphs and knowledge representation.

Ted Wilmes (Oklahoma City) @trwilmes

Ted Wilmes, Data Architect at Expero, is a graduate of Trinity University where he studied computer science and art history. He started his professional career at a not-for-profit research and development institution where he performed contract software development work for a variety of government and commercial clients. During this time he worked on everything from large enterprise systems to smaller, cutting edge research and development projects. One of the most rewarding parts of each of these projects was the time spent collaborating with the customer.
As Ted’s career continued, he moved on to an oil and gas startup and continued to dig deeper into the data side of software development, gaining an even deeper interest in how databases work and how to eek as much performance out of them as possible. During this time he became interested in the application of graph databases to certain problem sets. Today, at Expero, Ted enjoys putting his deep knowledge of transactional graph computing to work as he helps customers of all types navigate the burgeoning property graph database landscape.
Outside of work, Ted enjoys spending time with his family out-of-doors, listening to and playing loud music, and contributing to the Apache TinkerPop project as a committer and PMC member.

 


Martin Fowler of Thoughtworks holding a "fireside chat" for the Data Day 2019 audience.

 


Perennial Data Day favorite, Holden Karau, presenting the latest on Spark at DDTX19.

 


Jonathon Morgan, CEO of New Knowledge, discussing how to build a data science team, at DDTX19.