Who's speaking at Data Day Texas 2016?

The following speakers have been confirmed for Data Day Texas 2016, with still more to come.

We are still accepting submissions for Data Day 2016. Details can be found on our proposals page.

John Akred (SF Bay)

John Akred is the Founder and CTO of Silicon Valley Data Science. In the business world, John Akred likes to help organizations become more data driven. He has over 15 years of experience in machine learning, predictive modeling, and analytical system architecture. His focus is on the intersection of data science tools and techniques; data transport, processing and storage technologies; and the data management strategy and practices that can unlock data driven capabilities for an organization. A frequent speaker at the O'Reilly Strata Conferences, John is host of the perennially popular workshop: Building A Data Platform.
Twitter: @BigDataAnalysis
John will be giving the following presentation at Data Day Texas: Running Agile Data Science Teams
John will also be hosting office hours at Data Day Texas.

Carl Anderson (NYC)

Carl Anderson is the Director of Data Science at Warby Parker in New York overseeing data engineering, data science, supporting the broader analytics org, and creating a data-driven organization. He has had a broad-ranging career, mostly in scientific computing, covering areas such as healthcare modeling, data compression, robotics, and agent based modeling. He holds a Ph.D. in mathematical biology from the University of Sheffield, UK.
He is the author of "Creating a Data-Driven Organization" (O'Reilly, 2015)
Twitter: @LeapingLlamas
Carl will give the following presentation at Data Day Texas: Creating a Data-Driven Organization
While at Data Day Texas, Carl will also be signing his new book, Creating a Data Driven Organization

Preetha Appan (Austin)

Preetha Appan is the technical lead of the recommendations team at Indeed. Her past contributions to Indeed's job and resume search engines include keyword tokenization improvements, query expansion features, and major infrastructure and performance improvements. She enjoys working on challenging problems in machine learning and information retrieval.



Jason Baldridge (Austin)

Jason Baldridge is co-founder and Chief Scientist of People Pattern. Jason uses his expertise in natural language processing and machine learning to power the platform, turning unstructured social text into structured data. Jason is also Associate Professor of Computational Linguistics at the University of Texas at Austin, and is one of the co-creators of the Apache OpenNLP Toolkit. As a professor, Jason works on probabilistic models for categorization and syntax, with a particular emphasis on low-resource languages. He also focuses on methods and applications for connecting linguistic objects to geography and time. He has been active in the creation and promotion of open source software for natural language processing, including OpenNLP, ScalaNLP, Junto, TextGrounder and OpenCCG. Jason received his Ph.D. from the University of Edinburgh in 2002, where his doctoral dissertation was awarded the 2003 Beth Dissertation Prize from the European Association for Logic, Language and Information. His main academic research interests include categorial grammars, parsing, semi-supervised learning, co-reference resolution and geo-referencing.
Jason Baldridge will be appearing as part of NLP Day.

Michael Berthold (Zürich, Switzerland)

Michael Berthold is currently president of KNIME.com AG and co-creator of KNIME (wikipedia entry), the open analytics platform used by thousands of data experts around the world. Since August 2003, Michael has been the Nycomed-Chair for Bioinformatics and Information Mining at Konstanz University, Germany where his research focuses on using machine learning methods for the interactive analysis of large information repositories in the Life Sciences. Previously he held positions in both academia (Carnegie Mellon, UC Berkeley) and industry (Intel, Tripos).
Michael is Past President of the North American Fuzzy Information Processing Society, Associate Editor of several journals and the President of the IEEE System, Man, and Cybernetics Society. He has been involved in the organization of various conferences, most notably the IDA-series of symposia on Intelligent Data Analysis and the conference series on Computational Life Science. Together with David Hand he co-edited the textbook Intelligent Data Analysis: An Introduction which has recently appeared in a completely revised, second edition. He is also co-author of Guide to Intelligent Data Analysis (Springer Verlag) which appeared in summer 2010. When time permits Michael still writes code.

Lukas Biewald (SF Bay)

Lukas Biewald (Wikipedia) is the founder and CEO of CrowdFlower. Founded in 2007, CrowdFlower provides Labor-on-Demand to help companies outsource high-volume, repetitive tasks to a massively-distributed global workforce.
Before founding CrowdFlower, Lukas was a senior scientist and manager within the Ranking and Management Team at Powerset, Inc., acquired by Microsoft in 2008. He led the Search Relevance Team for Yahoo! Japan after graduating from Stanford University with a B.S. in Mathematics and an M.S. in Computer Science. Recently, Lukas won the Netexplorateur Award for GiveWork – a collaboration with Samasource that brings digital work to refugees worldwide. Lukas is also an expert level Go player.
Twitter: @l2k
While at Data Day Austin, Lukas will also be hosting office hours

Trey Blalock (Seattle)

Trey Blalock, (GIAC-GWAPT, GIAC-GPEN, GIAC-GCFA, CISA, CISM, CISSP, SSCP, NSA-IAM) has served as Manager of Global Security Operations / Security Architect for one of the worlds largest financial transaction hubs (S1 Corporation) overseeing all aspects of security for hundreds of web-banking environments, ATM networks, and point-of-sale transaction networks world-wide.
Currently on the National Board of Information Security Examiners (NBISE) Operational Security Testing Panel designing comprehensive testing solutions to evaluate skill levels of commercial penetration testers as well as military red team, and blue team technicians. This is primarily to be used by government & military to identify above-average talent in these areas.
Has over ten years of experience providing penetration testing and assessment services to hundreds of clients in the financial, government, retail, chemical, oil & gas, medical, educational, legal, telecom, and law enforcement sectors. See his full bio on the following page.
While in Austin, Trey will be offering an encore presentation of his Pentesting 101 Course/

Kurt Brown (SFBay)

Kurt Brown, leads the Data Platform team at Netflix. His group architects and manages the technical infrastructure underpinning the company’s analytics. The Netflix data infrastructure includes various big data technologies (Hadoop, Hive, and Pig), Netflix open sourced applications and services (Lipstick and Genie), and traditional BI tools (Teradata and MicroStrategy).
Kurt will be speaking on How to get the most out of your data platform.
Following his presentation at Data Day, Kurt will be holding office hours and discussing careers at Netflix.


Laine Campbell (Las Vegas)

Laine Campbell specializes in database architecture and operations, particularly MySQL and Cassandra. Laine is currently CTO of OrderWithMe. Most recently, Laine was a co-founder at Pythian, where she led the open source database practice. Prior to that, Laine founded and led PalominoDB, then Blackbird for 8 years, where her team of DBAs supported many of the most exciting database infrastructures in the industry. Before that, she designed, built and supported the Travelocity databases for 8 years with a remarkable team. She lives in Las Vegas, and travels extensively. Laine has been around the block.
Laine is passionate about supporting members of underserved populations to gain experience, skills and jobs in technology. She is an advocate of bringing women, people of color and LGBTQ individuals into the world of technology, and supporting them in their careers. She is also passionate about open source technologies and the commoditization of IT, and how it can support communities and the general welfare of the individual.
Twitter: @lainevcampbell

Ed Capriolo (NYC)

Ed Capriolo is a Data Architect at the Huffington Post. Previously, he was a software developer at Media 6 degrees. Ed is organizer of the NYC Cassandra Meetup group, as well as a Apache Hive PMC committer / member. Ed is author of multiple books, including the Cassandra High Performance Cookbook and Programming Hive.
While at Data Day Texas, Ed will also be signing the soon to be released second edition of his O'Reilly book, Programming Hive


Michelle Casbon (San Antonio / SFBay )

Michelle Casbon is a Senior Data Science Engineer at Idibon, where she is contributing to the goal of bringing language technologies to all the world’s languages. Her development experience spans a decade across various industries, including media, investment banking, healthcare, retail, and geospatial services. Michelle completed a Masters at the University of Cambridge, focusing on NLP, speech recognition, speech synthesis, and machine translation. She loves working with open source technologies and has had a blast contributing to the Apache Spark project. Holding technical conversations and learning from the people she meets is her favorite part of Data Day Texas.
Michelle will be giving the following presentation at Data Day Texas: Under the Hood of Idibon’s Scalable NLP Services
Michelle Casbon will be appearing as part of NLP Day.

Doug Daniels (NYC )

Doug Daniels is the Director of Engineering at Datadog, where he works on large-scale data systems for metrics monitoring, analytics, and data science. Prior to Datadog, Doug was CTO at Mortar Data and has also designed data systems used by 1 in 10 public school students in the US. He holds a BA in Computer Science from Rice University and spends his time in lovely Brooklyn, NY.
Doug will give the following presentation at Data Day Texas: Parquet at Datadog: Fast, Efficient, Portable Storage for Big Data


Ted Dunning (SFBay)

Ted Dunning is Chief Applications Architect at MapR Technologies and committer and PMC member of the Apache Mahout, Apache ZooKeeper, and Apache Drill projects and mentor for Apache Storm. He contributed to Mahout clustering, classification, and matrix decomposition algorithms and helped expand the new version of Mahout Math library. Ted was the chief architect behind the MusicMatch (now Yahoo Music) and Veoh recommendation systems, built fraud-detection systems for ID Analytics (LifeLock), and has issued 24 patents to date. Ted has a PhD in computing science from University of Sheffield. When he’s not doing data science, he plays guitar and mandolin. Ted is co-author, along with Ellen Friedman, of the recent O'Reilly media publications, Practical Machine Learning: Innovations in Recommendation, and Practical Machine Learning: A New Look at Anomaly Detection. By the way, Ted bought the beer at the first Hadoop meetup.
Twitter: @ted_dunning

Helena Edelson (Portland)

Helena Edelson has been a Software Engineer for over 15 years. After a decade in distributed messaging engineering she moved exclusively to working with Scala in production on big data and cloud infrastructure automation for large scale distributed systems. As a Senior Cloud Engineer she was on the first Scala team at VMware building multi-tenant cloud automation systems, then in big data architecting, building and deploying streaming and batch analytics pipelines for Cyber Security for real time threat analysis. Most recently she has worked on streaming analytics and machine learning at scale with FiloDB, Apache Spark, Cassandra, Kafka, Akka and Scala.
Helena is a committer to FiloDB, the Spark Cassandra Connector and a contributor to Akka, adding new features in Akka Cluster such as the initial version of the cluster metrics API and AdaptiveLoadBalancingRouter. While working at SpringSource she was a contributor to several open source projects such as Spring Integration and Spring AMQP. Helena is a speaker at international Big Data and Scala conferences such as Spark Summit in Europe and the US, QCon, Scala Days, Data Days, and Philly Emerging Technology. She is currently VP of Product Engineering at Tuplejump.
Twitter: @helenaedelson

Chris Fregly (SFBay)

Chris Fregly is Principal Data Solutions Engineer for the IBM Spark Technology Center. Chris is an Apache Spark contributor, the organizer of the Bay Area Advanced Spark Meetup, and author of the upcoming books Advanced Spark and Spark Streaming in Action. Chris has 15+ years of distributed big data systems experience across many domains including media/entertainment, banking, insurance, and travel. Previously, Chris was an engineer at Databricks, Streaming Data Engineer at Netflix, Platform/Data Engineer at Playboy Enterprises, and a Distributed Systems Engineer at BEA Systems.
Twitter: @ cfregly

Ellen Friedman (SFBay)

Ellen Friedman is a solutions consultant, scientist and author, currently writing about a variety of open source and big data topics including being co-author of Mahout in Action (Manning), the Practical Machine Learning series from O’Reilly, and the newest title, Time Series Databases (O’Reilly). She is a committer on the Apache Mahout project, a contributor to Apache Drill and has been an invited speaker at Berlin Buzzwords 2013, the Philly ETE 2014 conference and keynote speaker for NoSQL Matters 2014 in Barcelona. With a Ph.D. in biochemistry and years of work writing on a variety of scientific and computing topics, she is an experienced communicator. She’s also co-author of a book of magic-themed cartoons, A Rabbit Under the Hat.
Twitter: @Ellen_Friedman

Luca Garulli (London, UK)

Luca Garulli is the CEO and Founder of Orient Technologies, and the original author of OrientDB. Luca started working with storage algorithms in 1998 and created the first production-ready version of OrientDB in early 2010 after 17 years of experience working with other DBMSs. Luca is a member of the Sun Microsystems JDO 1.0 and 2.0 Expert Groups that wrote the JDO standard. He has also published various tech articles in Technet, Computer Programming, IoProgrammer, and Week.it magazines.
Twitter: @lgarulli
Luca will be holding office hours at Data Day. He will also be speaking at Graph Day


Nicholas Gaylord (SF Bay)

Nicholas Gaylord is a data scientist at Idibon. Bio forthcoming.

Nicholas Gaylord will be appearing as part of NLP Day.





Jonathan Gray (SF Bay)

Jonathan Gray, founder and CEO of Cask, is an entrepreneur and software engineer with a background in startups, open source, and all things data. Prior to founding Cask, Jonathan was a software engineer at Facebook where he drove HBase engineering efforts, including Facebook Messages and several other large-scale projects, from inception to production.
An open source evangelist, Jonathan was responsible for helping build the Facebook engineering brand through developer outreach and refocusing the open source strategy of the company. Prior to Facebook, Jonathan founded Streamy.com, where he became an early adopter of Hadoop and HBase and is now a core contributor and active committer in the community.
Jonathan holds a Bachelor’s degree in Electrical and Computer Engineering and Business Administration from Carnegie Mellon University.


Joel Grus (Seattle)

Joel Grus is a software engineer at Google. Before that he worked as a data scientist at multiple startups. He lives in Seattle, where he regularly attends data science happy hours.
Personal Site
Twitter: @JoelGrus
While at Data Day Texas, Joel will also be signing his recently released O'Reilly book, Data Science From Scratch


Sarah Guido (NYC)

Sarah Guido is a data scientist at Bitly and is interested in all things Python, data, and machine learning. She is a co-organizer of the PyGotham conference and the NYC Python meetup. Excited to share her passion for data with others, she has spoken at conferences such as PyCon, OSCON and PyData, and is writing an O'Reilly book on machine learning. Prior to joining Bitly, she worked in a few other startups and graduated from the University of Michigan’s School of Information.
Twitter: @sarah_guido
While at Data Day Texas, Sarah will also be signing her new book, Introduction to Machine Learning with Python


Russell Jurney (SF Bay)

Russell Jurney is founder and CEO of Relato. Russell has over a decade of experience building analytic applications, from casino gaming to inbox analytics. Russell is passionate about graphs and sees networks in the world around him. Mapping markets to achieve a deeper understanding of how they work is exciting work.
Prior to Relato, Russell was a Data Scientist in Residence at The Hive, where he helped launch E8 Security as their first engineer. Before that he was Evangelist at Hortonworks, after being Senior Data Scientist in product analytics at LinkedIn. Russell is author the recently released O'Reilly book Agile Data Science as well co-author of the the soon to be released O'Reilly book: Big Data for Chimps. Russell is originally from Atlanta, GA. He lives in Pacifica, California with Bella the Data Dog.
Data Syndrome
Twitter: @rjurney

While at Data Day, Russell will be holding office hours and signing copies of Big Data for Chimps. He will also be speaking at Graph Day on Sunday, January 18.


Jason Kessler (Seattle)

Jason Kessler is a data scientist at CDK Digital Marketing, where he analyses language use and consumer behavior in the online auto-shopping ecosystem. Prior to joining CDK, Jason was the founding data scientist at PlaceIQ, and worked as a research scientist for JD Power and Associates Web Intelligence Division. He has published peer-reviewed papers on systems and corpora for sentiment and belief analysis, and has reviewed and sat on program committees for ICWSM, ACL Demos, RANLP, CICLing, and others venues.



Jay Kreps (SF Bay)

Jay Kreps is the original author of multiple well-known projects including Apache Kafka, Apache Samza, Voldemort, and Azkaban. Formerly Principle Staff Engineer at Linkedin. Jay is also co-founder and CEO at Confluent - a company built around realtime data streams and the open source messaging system Apache Kafka. Jay is the author of the upcoming O'Reilly book, I Heart Logs: Event Data, Stream Processing, and Data Integration.
Twitter: @jaykreps
While at Data Day Texas, Jay will also be signing his new book, I Heart Logs

Homin Lee (NYC)

Homin Lee is a data scientist for Datadog, where he writes algorithms that process hundreds of billions data points a day. Prior to Datadog, Homin built large-scale machine learning systems at several start-ups. Homin has a PhD from Columbia University in computational learning theory, and was a Computing Innovation Fellow at the University of Texas at Austin.




Charity Majors (San Francisco)

Charity Majors is a systems geek and scalability nerd. Most recently, Charity was the Production Engineering Manager at Parse/Facebook - where she managed of the biggest and most challenging MongoDB deployments in the world, with dynamic indexing and other fun goodies. Also in the mix at Parse were mysql, Cassandra, Redis, Hive, and others. Wee.
Before joining Parse she built systems for companies like Linden Lab, Shopkick and Cloudmark, and developed a passion for resilient and self-healing architecture. She reads a lot of economics and drinks a lot of whiskey. Charity went to school on a piano performance scholarship and studied philosophy, classical studies and music composition before dropping out to come play in the dotcom games. Charity's Data Day 2014 talk was one of the most frequently mentioned and highly rated.
Twitter: @mipsytipsy

Taylor Martin (SF Bay)

Taylor Martin is a pioneer in bringing Big Data to education. She is among the founders of the learning analytics movement, which seeks to use data from online games and learning experiences to better personalize instruction, particularly in mathematics.
At the National Science Foundation, Taylor manages a portfolio of awards and the evaluation of new proposals in a variety of programs, including BIGDATA, Building Community and Capacity (BCC), DRK-12, STEM+C, Cyberlearning, and EHR Core Research. Key themes for the projects in her portfolio are using data science to answer critical questions in STEM Education research, realizing the potential that technology brings for improving STEM education for all students, and exploring novel technologies that could revolutionize that potential. Taylor also works on a variety of efforts across the foundation to understand how Big Data is impacting research in Education and across the STEM disciplines.

Patrick McFadin (SF Bay)

Patrick McFadin is regarded as one of the foremost experts of Apache Cassandra and data modeling techniques. As the Chief Evangelist for Apache Cassandra and consultant for DataStax, he has helped build some of the largest deployments in the world. Previous to DataStax, he was Chief Architect at Hobsons, an education services company. There, he spoke often on web application design and performance.
Twitter: @patrickmcfadin

Wes McKinney (SF Bay)

Wes McKinney is a software engineer at Cloudera. Prior to that, Wes was co-founder of DataPad, and CTO and Cofounder of Lambda Foundry, Inc. From 2010 to 2012, he served as a Python consultant to hedge funds and banks while developing pandas, a widely used Python data analysis library. From 2007 to 2010, he researched global macro and credit trading strategies at AQR Capital Management. He graduated from MIT with an S.B. in Mathematics. He is on leave from the Duke University Ph.D program in Statistics. Wes is author of the O'Reilly book Python for Data Analysis.
Wes McKinney's Blog
Twitter: @wesmckinn

Mark Mims (SF Bay)

Mark Mims is a Principal Engineer at Silicon Valley Data Science and his passion is Data Plumbing, where Data Science meets the real world of DevOps and Infrastructure Engineering. Mark has extensive experience architecting and implementing data science solutions across a variety of industries including Entertainment, Insurance, Finance, Energy, Education, Manufacturing, and Commercial Modeling and Simulation. Before joining SVDS, Mark was the Principal Data Architect for Infochimps/CSC building managed "Big Data" pipelines for CSC's Enterprise customer-base. There, he used his deep full-stack datascience infrastructure expertise to adapt the cloud-based Infochimps product line to Openstack-based dedicated rack customer deployments. Previously, He worked for Canonical building DevOps tools for Ubuntu Server to make sure Ubuntu Server meets the needs of Data Plumbers everywhere. Mark has a doctorate in Mathematical Physics from UT Austin for research simulating quantum algorithms and is very interested in what it takes to train data scientists.
While at Data Day Texas, Mark will be holding office hours with Silicon Valley Data Science.

Ryan Mitchell (Somerville, MA)

Ryan Mitchell is a Software Engineer at LinkeDrive in Boston, where she develops their API and data analysis tools. She is a graduate of Olin College of Engineering, and is a Masters degree student at Harvard University School of Extension Studies. Prior to joining LinkeDrive, she was a Software Engineer working on web scraping and data analysis at Abine. Ryan is author of Instant Web Scraping with Java and the upcoming O'Reilly book: Web Scraping with Python.
Ryan's webpage
Twitter: @ Kludgist
While at Data Day Texas, Ryan will also be signing her new book, Introduction to Machine Learning with Python

Christopher Moody (SF Bay)

Chris Moody loves high-performance computing, high dimensions & high fashion. He loves learning the beautiful symmetries between physics, data, and analytics. Went to Caltech, did astrostats & supercomputing and now Data Labs at Stitch Fix. Currently enjoying coding up word2vec, Gaussian Processes, Deep RNNs and t-SNE.
Jason Baldridge will be appearing as part of NLP Day.


Robert Munro (San Francisco)

Robert Munro is the CEO of Idibon, founded with the goal of bringing language technologies to all the world’s languages. He is a world leader in applying big data analytics to human communications, having worked in many diverse environments, from Sierra Leone, Haiti and the Amazon to London, Sydney and San Francisco. He completed a PhD in Computational Linguistics as a Graduate Fellow at Stanford University. Outside of work, he has learned about the world’s diversity by cycling more than 20,000 kilometers across 20 countries, mostly through the mountains.
Rob's webpage
Twitter: @WWRob
Rob Munro will be appearing as part of NLP Day.

Diego Oppenheimer (Seattle)

Diego Oppenheimer is a data geek with a passion for sports and cooking. He has worked in multiple industries in different capacities around business intelligence and data analytics.Prior to founding Algorithmia where he serves the role of CEO he spent over five years at Microsoft where he had the chance to deliver some of the most widely used data analysis software in the world including Excel, SQL Server and Power Pivot. He received his Bachelors in Information Systems Management and Masters degree in IS - Business Intelligence and Data Analytics from Carnegie Mellon University.
Twitter: @doppenhe
Diego will also be hosting office hours at Data Day Texas.

Stephen O'Sullivan (SF Bay)

Stephen O'Sullivan is the VP of Engineering at Silicon Valley Data Science, where he leads data architecture and infrastructure. A veteran of WalmartLabs, Sun and Yahoo! with over 20 years of experience creating scalable, high-availability, data and applications solutions, Stephen is leading expert on big data architecture and Hadoop.
Stephen will also be hosting office hours at Data Day Texas.


Claudia Perlich (NYC)

Prior to joining Dstillery (former Media6Degrees), Claudia Perlich spent five years working at the Data Analytics Research group at the IBM T.J. Watson Research Center, concentrating on research in data analytics and machine learning for complex real-world domains and applications. She has been published in over 30 scientific publications and holds multiple patents in the area of machine learning. Claudia has won many data mining competitions, including the prestigious 2007 KDD CUP on movie ratings, the 2008 KDD CUP on breast-cancer detection, and the 2009 KDD CUP on churn and propensity predictions for telecommunication customers. Claudia received her Ph.D. in Information Systems from Stern School of Business, New York University in 2005, and holds a Master of Computer Science from Colorado University.
Twitter: @claudia_perlich
Claudia will also be hosting office hours at Data Day Texas.

Fintan Quill (NYC)

Fintan Quill is the global head of sales engineering for Kx Systems. An expert in developing database analytic systems, Fintan joined Kx in 2012 after having worked extensively with quantitative teams at a variety of Wall Street investment banks, hedge funds, and trading shops building high-performance Big Data applications. After beginning his career with First Derivatives, a global financial technology consultancy based in Northern Ireland, he moved to the U.S., where he worked at Nomura, Barclays Capital, and other investment banks. Fintan is a graduate of Trinity College in Dublin with a specialization in Computing and Microelectronic Engineering.



Ben Reiter (SF Bay)

Ben Reiter is a Senior Engineer on the Architecture team on Vungle. His responsibilities include planning and implementing a new shared data pipeline throughout Vungle. His focus for the past year has been on designing and implementing the way data flows through and is processed within Vungle's architecture.
Ben will be walking through their Spark use case at Vungle.



Eric Sammer (San Francisco)

Eric Sammer , Co-Founder and CTO at Rocana, is deeply entrenched in the open source community with a passion for solving difficult scaling and processing problems. Prior to Rocana, Eric most recently served as an Engineering Manager at Cloudera, responsible for developer tools and partner integrations. Eric’s team worked with hundreds of partners to develop robust solutions and integrate them tightly with Cloudera’s Enterprise Data Hub. He was previously a Principal Solutions Architect, working with customers and strategic partners to support and integrate Hadoop clusters and related infrastructure. While working with some of Cloudera’s largest customers, Eric developed many of the best practices for developing large, distributed, data processing infrastructure.
Eric is a committer on the Apache Flume and Apache MRUnit projects, and the creator of the Kite open source project. Prior to Cloudera, Eric served as a Senior Engineer and Architect at several large scale data driven organizations including Experian and Conductor. Eric is author of Hadoop Operations, published by O'Reilly. He speaks frequently on technology and techniques for large scale data processing, integration, and system management.
Twitter: @esammer

Eric Schmidt (Seattle)

Eric Schmidt is the Product Management lead for Cloud Dataflow at Google. His primary role is to help shape the future of fully managed - large scale data processing. Schmidt spends the majority of his time working with existing cloud customers as well as on premise developers who are moving their MapReduce and related data processing workloads to the cloud. Beyond his Google Cloud focus, he has a deep passion for user interaction modeling, data modeling & analytical processing of user behaviors and development experience with .NET, C, JavaScript, Python, and Java.
Twitter: @DJ Rhythma
Eric will be giving the following talk at Data Day Texas: Google Cloud Dataflow - Two Worlds Become A Much Better One



Brent Schneeman (Austin)

Brent Schneeman joined HomeAway in 2010 and focuses on strengthening the data science muscle in the Technology Office. As Director of Data Science, he serves as an internal consultant on a diverse set of analytic projects such as multi-variate testing, customer website behavior and applying natural language processing techniques to unstructured data. A storyteller, Brent has presented at South By Southwest and has given many technological talks. Prior to joining HomeAway, Brent worked at PayPal and Visa. He has one degree in Mathematics and another in Electrical Engineering and lives in Austin Texas with his wife and three kids and spends most of his free time mowing the lawn.
Brent Schneeman will be appearing as part of NLP Day.

Fangjin Yang (SF Bay)

Fangjin Yang is one of the main committers to the open source Druid project and one of the first developers at Metamarkets, a San Francisco-based data startup. Fangjin previously worked on diagnostic optimization algorithms at Cisco Systems. He holds a BASc in Electrical Engineering and a MASc in Computer Engineering from the University of Waterloo, Canada.