Who will be speaking at Data Day 2015?
Data Day 2014 brought an incredible group of speakers in the data space to Austin. Among those speaking and leading workshops were Michael Berthold, Gary Dusbabek, Christopher Johnson, Russell Jurney, Steve Kramer, Eric Lubow, Charity Majors, Paco Nathan, Sam Richie, Matthew Russell, Eric Sammer, Joe Stein, Eric Tschetter, and Josh Wills.
We are currently in the process of selecting speakers for 2015. Below are some of the speakers who have already confirmed for talks and workshops. If you would like to speak at Data Day, the Instructions for submitting a proposal are on our proposals page.
Confirmed speakers for Data Day Texas 2015
Michael Berthold (Zürich, Switzerland)
One of the most requested speakers at last year's Data Day,
While in town, Michael will also be leading full-day beginning and advanced workshops in KNIME.
Oscar Boykin (San Francisco)
Comfortable in Java, C++, C, C#, Python, Haskell, Scala and many other programming languages, Oscar has an extensive background developing both computational science codes as well as systems programming experience with large scale distributed systems. He also has a strong statistics, probability and information theory background which he applies to machine learning and data-mining problems.
Julia Evans (Montreal)
Chris Fregly (San Francisco)
Chris Johnson (NYC)
Chris Johnson is a machine learning engineer at Spotify where he hacks on music data, builds the best music recommendation system on the planet, and feeds multiple terabytes of data to Hadoop every day. Chris's toolchest includes Python, Numpy, Scikit-Learn, Hadoop, Hive, Java, Cassandra, and Storm.
As both a researcher and an engineer Chris is interested in problems of high dimension and efficient methods of scaling learning under the presence of massive data sets. He is particularly interested in the scalability, design, and architecture decisions that arise within real-time recommender systems such as music recommendation. His research has been featured at premier Machine Learning conferences including NIPS and AISTATS.
In his free time Chris is an avid rock climber, photographer, and music lover who enjoys traveling across the world to remote climbing destinations, experiencing music from a multitude of cultures, and snapping a corpus of photos along the way.
Chris holds MS and BS degrees from UT Austin.
UT CS Webpage
Matthew Kirk (Seattle)
Matthew is the author of the upcoming O'Reilly book, Thoughtful Machine Learning.
Nathan Marz (SF Bay)
Nathan created the Storm and Cascalog projects and has many other projects on his GitHub page. His projects are relied upon by over 50 companies around the world, including Yahoo!, Twitter, Groupon, The Weather Channel, Taobao, and many others.
Nathan is working on a book called Big Data: principles and best practices of scalable realtime data systems for Manning Publications.
Paco Nathan (Mountain View)
Paco Nathan is a "player/coach" who has led innovative Data teams building large-scale apps for several years. Expertise in distributed systems, machine learning, cloud computing, functional programming. Paco is an O'Reilly author, and currently an open source evangelist for Apache Spark with Databricks and an advisor for Amplify Partners. Paco's current interests include Enterprise data workflows, math literacy among execs, and the intersection Ag+Data. He received his BS Math Sci and MS Comp Sci degrees from Stanford University, and has 30+ years technology industry experience ranging from Bell Labs to early-stage start-ups. Most recently, Paco published his Just Enough Math video for O'Reilly Media.
While Paco is in town, he will be offering multiple full day training sessions through GeekAustin. Details to follow.
Matthew Russell (Nashville)
Matthew is Chief Technology Officer at Digital Reasoning, Principal at Zaffra, and author of several books on technology including Mining the Social Web (O'Reilly, 2013), now in its second edition. He is passionate about open source software development, data mining, and creating technology to amplify human intelligence. Matthew studied computer science and jumped out of airplanes at the United States Air Force Academy. When not solving hard problems, he enjoys practicing Bikram Hot Yoga, CrossFitting and participating in triathlons..
Eric Sammer (San Francisco)
Eric is a committer on the Apache Flume and MRUnit projects and the creator of the Kite open source project. Prior to Cloudera, Eric served as a Senior Engineer and Architect at several large scale data driven organizations including Experian and Conductor. Eric is the author of Hadoop Operations published by O’Reilly Media. He speaks frequently on technology and techniques for large scale data processing, integration, and system management.. Eric is author of Hadoop Operations, published by O'Reilly.
Dean Wampler (Chicago)
Website / Blog
Highlights from 2014
Above: Charity Majors, Production Engineering Manager at Parse/Facebook, discussing how she gets mongo, cassandra, mysql, redis, and hive to play together in a heterogenous environment.