The following speakers are confirmed for Data Day Texas 2017. We will be announcing the final 25 speakers over the next few weeks. Don't wait to get your ticket. Take advantage of the advance registration discount.
A list of the NLP Day Speakers can be found here.
A list of the Graph Day Speakers can be found here.
KEYNOTE - Emil Eifrem (SF Bay) @emileifrem
Emil Eifrem (Linkedin) sketched what today is known as the property graph model on a flight to Mumbai in 2000. This sketch grew into the Neo4j project (wikipedia) which, since its initial release in 2007, has been the most known and most widely implemented graph database. Emil is not only co-founder of the Neo4j project, but also co-founder and CEO of Neo Technology - a global organization with offices in San Mateo, Sweden, UK, Germany, and France. It is safe to say that there is no greater evangelist for the graph database space than Emil, and Data Day is honored to have him give the keynote for Data Data TX 2017 - the year of the graph.
(NEW) Paul Dix (NYC)
Paul Dix has been a fixture on the NYC data scene since before people were throwing around the phrase "big data". Paul is organizer of the NYC Machine Learning meetup, which for many years has been the largest group of its kind in the world. Paul is also editor of the Addison Wesley Data and Analytics Series
Most recently, Paul founded InfluxDB, a company which produces an open-source, distributed, time series database with no external dependencies.
Robert Munro (San Francisco) @WWRob
Robert Munro (Linkedin), Principal Product Manager for Machine Learning at Amazon Web Services, is a computational linguist and data scientist working at the leading edge of scalable language technologies. Prior to joining Amazon, Robert founded Idibon, which brought together one of the strongest teams in machine learning, shipping products that combined automation and human feedback to industry leaders across multiple markets & locations.
Robert, a world leader in applying machine learning, natural language processing, crowdsourcing and big data analytics to human communication, has worked in many diverse environments, from Sierra Leone, Haiti and the Amazon to London, Sydney and San Francisco, with organizations ranging from Silicon Valley Startups to the United Nations.
John Akred is the Founder and CTO of Silicon Valley Data Science. In the business world, John Akred likes to help organizations become more data driven. He has over 15 years of experience in machine learning, predictive modeling, and analytical system architecture. His focus is on the intersection of data science tools and techniques; data transport, processing and storage technologies; and the data management strategy and practices that can unlock data driven capabilities for an organization. A frequent speaker at the O'Reilly Strata Conferences, John is host of the perennially popular workshop: Building A Data Platform.
John will also be hosting office hours at Data Day Texas.
Laine Campbell specializes in database architecture and operations, particularly MySQL and Cassandra. Most recently, Laine was the CTO at OrderWithMe. Prior, she was a co-founder at Pythian, where she led the open source database practice. Laine founded and led PalominoDB, then Blackbird for 8 years, where her team of DBAs supported many of the most exciting database infrastructures in the industry. Before that, she designed, built and supported the Travelocity databases for 8 years with her remarkable team. Laine has also supported such organizations as Obama for America, Zappos, Chegg, LiveJournal, Disney Mobile, and Adobe.
While at Data Day, Laine will be holding office hours and signing copies of her upcoming O'Reilly book: Database Reliability Engineering.
Joey Echeverria (SF Bay) @fwiffo
Joey Echeverria is the platform technical lead at Rocana, where he builds applications for scaling IT operations built on the Apache Hadoop platform. Joey is a committer on the Kite SDK, an Apache-licensed data API for the Hadoop ecosystem. Joey was previously a software engineer at Cloudera, where contributed to several ASF projects including Apache Flume, Apache Sqoop, Apache Hadoop, and Apache HBase. Joey is also a coauthor of Hadoop Security, published by O'Reilly Media.
Nicholas Gaylord (SF Bay) @texastacos
Nicholas Gaylord is Senior Data Scientist at CrowdFlower, where he helps build out their new machine learning offering, CrowdFlower AI. CrowdFlower AI allows data scientists to construct, monitor, and improve machine learning models using data collected at scale from human contributors via the CrowdFlower platform, in a tightly integrated human-in-the-loop active learning environment. Prior to CrowdFlower, Nick was a data scientist at SF text analytics startup Idibon. He has a PhD from the University of Texas at Austin, where his research focused on human language comprehension and the construction of datasets for NLP applications. In his spare time he fixes bikes and collaborates on work applying cognitive science principles to the public health domain.
Nicholas Gaylord will be appearing as part of NLP Day Texas.
(NEW) Juliet Hougland (SF Bay) @JulietHougland
Juliet Hougland is a data scientist at Cloudera, and contributor/committer/maintainer for the Sparkling Pandas project. Her commercial applications of data science include developing predictive maintenance models for oil and gas pipelines at Deep Signal, and designing/building a platform for real-time model application, data storage, and model building at WibiData. Juliet was the technical editor for Learning Spark by Karau et al. and Advanced Analytics with Spark by Ryza et al. She holds an M.S. in applied mathematics from the University of Colorado, Boulder and graduated Phi Beta Kappa from Reed College with a BA in math-physics..
Juliet will be holding the following session: How to Observe: Lessons from Epidemiologists, Actuaries and Charlatans.
Holden Karau (San Francisco) @holdenkarau
Holden Karau is a software development engineer and is active in open source. She a co-author of Learning Spark & Fast Data Processing with Spark and has taught intro Spark workshops. Prior to IBM she worked on a variety of big data, search, and classification problems at Alpine, DataBricks, Google, Foursquare, and Amazon. She graduated from the University of Waterloo with a Bachelors of Mathematics in Computer Science. Outside of computers she enjoys dancing & playing with fire.
Check out the recent Global Data Geeks interview with Holden Karau.
While at Data Day, Holden will be holding office hours and signing copies of her O'Reilly book: High Performance Spark.
Alex Korbonits (Seattle) @korbonits
Alex Korbonits is a Data Scientist at Remitly, Inc., where he works extensively on feature extraction and putting machine learning models into production. Outside of work, he loves Kaggle competitions, is diving deep into topological data analysis, and is exploring machine learning on GPUs. Alex is a graduate of the University of Chicago with degrees in Mathematics and Economics.
Alex Korbonit's session: Distilling dark knowledge from neural networks.
(NEW) - Leland Lockhart (Austin)
Leland Lockhart, Ph.D. serves as chief data officer for XOR Data Exchange, an Austin-based startup developing new approaches to fraud and risk mitigation. In his role, Lockhart is responsible for oversight of data analytics across all products and research. Prior to his role at XOR, he served in various senior strategic research and data science roles for national leaders in the finance and consumer product industries.
Lockhart’s areas of technical expertise include credit and fraud risk modeling, statistical mediation analysis, machine learning, statistical programming, inferential statistics, psychometrics, behavioral analytics, unstructured data and big data technologies.
Dr. Taylor Martin's mission in life is understanding how people learn. She's particularly interested in how adaptive and personalized learning can best be used to help people reach their learning goals faster.
As an established academic and thought-leader in the Learning Sciences, Dr. Martin has spearheaded data-centric approaches to developing learning environments and then measuring how people learn Science, Math, Engineering and Computer Science in these environments. This includes environments such as online games, online programming environments (e.g., scratch.mit.edu), internship programs, Maker spaces, and engineering design labs.
In her current role as Principal Learning Scientist at O’Reilly Media, she's focused on implementation. She's helping a team of data scientists and engineers mix in just the right amount of data-driven "learning engineering" to personalize the learning experience across various forms of published media.
Check out our interview with Taylor Martin.
(NEW) - Mark Mims (SF Bay) @m_3
Mark Mims is a Principal Engineer at Silicon Valley Data Science and his passion is Data Plumbing, where Data Science meets the real world of DevOps and Infrastructure Engineering. Mark has extensive experience architecting and implementing data science solutions across a variety of industries including Entertainment, Insurance, Finance, Energy, Education, Manufacturing, and Commercial Modeling and Simulation. Before joining SVDS, Mark was the Principal Data Architect for Infochimps/CSC building managed "Big Data" pipelines for CSC's Enterprise customer-base. There, he used his deep full-stack datascience infrastructure expertise to adapt the cloud-based Infochimps product line to Openstack-based dedicated rack customer deployments. Previously, He worked for Canonical building DevOps tools for Ubuntu Server to make sure Ubuntu Server meets the needs of Data Plumbers everywhere. Mark has a doctorate in Mathematical Physics from UT Austin for research simulating quantum algorithms and is very interested in what it takes to train data scientists.
While at Data Day Texas, Mark will be holding office hours with Silicon Valley Data Science.
(NEW) - Patrick McFadin (SF Bay)
Patrick McFadin is regarded as one of the foremost experts of Apache Cassandra and data modeling techniques. As the Chief Evangelist for Apache Cassandra and consultant for DataStax, he has helped build some of the largest deployments in the world. Previous to DataStax, he was Chief Architect at Hobsons, an education services company. There, he spoke often on web application design and performance.
Ryan Mitchell (Somerville, MA) @Kludgist
Ryan Mitchell (Linkedin) is a senior software engineer at HedgeServ , She received her master's in software engineering from Harvard University, Extension School, and a bachelor's in Engineering at Olin College of Engineering. Prior to joining HedgeServ, Ryan was a Software Engineer building web scrapers and bots at Abine Inc. Ryan is the author of two books about web scraping: Web Scraping with Python (O’Reilly, 2015), and Instant Instant Web Scraping with Java (Packt, 2013), as well as an upcoming O’Reilly video series: Web Crawling with Python.
In addition to speaking at past Data Day events in both Seattle and Austin, Ryan gives talks and runs workshops around the country, including an upcoming 8 week web development course through the Boston Public Library this fall.
Jonathon Morgan (Linkedin) is Founder and CEO at New Knowledge. a company building technologies to understand and predict human behavior. As part of his ongoing work applying quantitative methods to combating violent extremism, he served as an advisor to the White House and State Department, co-authored the ISIS Twitter Census for the Brookings Institution, and develops new technology with DARPA. Jonathon is also the co-host of Partially Derivative, an unrealistically popular podcast about data science and drinking.
Jonathon will be giving the following presentation: Decision Boundary 2016: Margin of Terror (aka Data Scientists are Bad at Politics)
(NEW) - Stephen O'Sullivan (SF Bay) @steveos
Stephen O'Sullivan is the VP of Engineering at Silicon Valley Data Science, where he leads data architecture and infrastructure. A veteran of WalmartLabs, Sun and Yahoo! with over 20 years of experience creating scalable, high-availability, data and applications solutions, Stephen is leading expert on big data architecture and Hadoop.
Stephen will also be hosting office hours at Data Day Texas.
(NEW) - Nelson Ray (SF Bay)
Nelson Ray manages the Risk Science group at Opendoor in San Francisco. His team is responsible for pricing the fee for Opendoor's home buying service and for optimizing resale strategy using a variety of machine learning models and experimental techniques. Prior to joining Opendoor, Nelson was a data scientist at Google and a software engineer at Metamarkets. He holds a BS in mathematics and an MS and PhD in statistics from Stanford University.
Nelson will be giving the following presentation: When A/B Testing Fails: A Case Study in Real Estate
Melissa Santos (Portland) @ansate
Melissa Santos has over a decade of experience working with data, from ETLs and reporting to Hadoop clusters and marketing analytics. In her previous role as Engineering Manager of Etsy, she led her team from being a Hadoop Infrastructure team that was constantly fixing problems and cleaning up messes, to declaring themselves to be a Data Platform team, expanding into investigating new tools, teaching coworkers about big data, and consulting with other teams about how to meet their data needs. Favorite past projects include implementing a beta-binomial model in SAS, creating neighborhood boundaries from Flickr and OpenStreetMap data, using principal components analysis to detect spam emails, and teaching coworkers to write Scalding jobs. Melissa's professional goal is to make data more accessible to all parts of the business, and to businesses of every size. She has a PhD in Applied Math and is currently the (sole) Data Scientist for Big Cartel.
Melissa will be giving the following presentation: Distances, Similarities, and Scores: Practical Model Examples
Julie Steele (NYC)
Julie Steele , Julie thinks in metaphors and finds beauty in the clear communication of ideas. She is particularly drawn to visual media as a way to understand and transmit information, and is co-author of Beautiful Visualization (O’Reilly 2010) and Designing Data Visualizations (O’Reilly 2012).
Eric Tschetter (San Francisco)@zedruid
Eric Tschetter Eric Tschetter started the Druid project, an open source, real-time analytical data store. Eric currently works as a distinguished engineer at Yahoo, where he endeavors to speed up analytics with a mix of data science and traditional BI. Eric previously worked with diabetes data at Tidepool, a nonprofit, was the VP of engineering and lead architect at Metamarkets, and has held senior engineering positions at Ning and LinkedIn. He holds bachelor’s degrees in computer science and Japanese from the University of Texas at Austin and an MS in computer science from the University of Tokyo..
Dean Wampler (Chicago)
Dean Wampler, is a software developer, new data scientist, technical author, and frequent public speaker living in Chicago. Dean is the author of Functional Programming for Java Developers, the co-author (with Alex Payne) of Programming Scala, and the co-author (with Edward Capriolo and Jason Rutherglen) of Programming Hive, all published by O'Reilly Media. Dean is a frequent speaker at conferences and user groups. Many of his presentations can be found at his Polyglot Programming site. Dean also helps organize several conferences as well as started the Chicago-Area Scala Enthusiasts user group.
Website / Blog
Michelle Casbon (San Antonio) @texasmichelle
Michelle Casbon is Director of Data Science at Qordoba. Previously, she was a Senior Data Science Engineer at Idibon, where she contributed to the goal of bringing language technologies to all the world’s languages. Michelle's development experience spans a decade across various industries, including media, investment banking, healthcare, retail, and geospatial services. Michelle completed a Masters at the University of Cambridge, focusing on NLP, speech recognition, speech synthesis, and machine translation. She loves working with open source technologies and has had a blast contributing to the Apache Spark project. Holding technical conversations and learning from the people she meets is her favorite part of Data Day.
Michelle will be giving the following presentation: Untangling the Ball of Strings: Machine Learning for Localization
Check out our interview with Michelle Casbon.
NLP Day Speakers
(NEW) - Sanghamitra Deb (SF Bay) @sangha_deb
Sanghamitra Deb is a Data Scientist at Accenture Technology Laboratory. As a data scientist at a Accenture she has worked on a wide variety of problems related data modeling, architecture and visual story telling. She has also worked in multiple data roles in different projects. Her primary focus is application of Natural Language Processing and Machine Learning to enterprise data. She is active in Data Science outreach and believes in applying analytics to a range of domains such as pharma, HR, customer support, market research, etc. Prior to being data scientist she was an astrophysicist who studied the structure of the universe by modeling galaxy clusters.
Sanghamitra will be holding the following session: Creating Knowledgebases from text in absence of training data.
Jason Kessler is a data scientist at CDK Global, where he analyses language use and consumer behavior in the online auto-shopping ecosystem. Prior to joining CDK, Jason was the founding data scientist at PlaceIQ and worked as a research scientist for JD Power and Associates. He has published peer-reviewed papers on algorithms and corpora for sentiment and belief analysis, and has sat on program committees and reviewed for several AI and NLP conferences. Most recently, he has delivered talks on the identification of persuasive and influential language language to the 2015 Sentiment Symposium and Data Day Seattle 2016.
Stefan Krawczyk (San Francisco) @stefkrawczyk
Stefan Krawczyk loves the stimulus of working at the intersection of design, engineering, and data. He spent formative years at Stanford, LinkedIn, Nextdoor & Idibon, working on everything from growth engineering, product engineering, data engineering, to recommendation systems, NLP, data science and business intelligence. At Stitch Fix he’s leading development of the algorithm development platform.
Stefan Krawczyk will be appearing as part of NLP Day Texas.
Rob McDaniel (Seattle)
Rob McDaniel is the co-founder of Lingistic -- a Seattle-based machine learning startup currently exploring the semantic modeling and analysis of political text. Lingistic's first product was a text classifier designed to identify political bias in news articles, which the company is currently expanding into a public service. He first fell in love with natural language processing and machine learning while building machine translation software for mobile phones, and has since focused his career on unsupervised learning problems, natural language processing, and the extraction of taxonomies and semantics from unstructured text.
Gabor Melli (San Francisco) @gmelli
Gabor Melli is the Director of Data Science at OpenGov where he leads their initiatives to automate knowledge-intensive text-rich processes. This work largely involves the training of predictive models for classification, sequence labeling, and estimation for tasks such as named entity recognition and disambiguation in user generated text using techniques and tools such as: CRFs, SVMs, HMMs, Logistic, LDA, NLTK, Spark, Python, R, and AWS' EC2/S3/EMR. He has led and delivered large-scale data-driven initiatives at organizations ranging from Microsoft, AT&T, T-Mobile, ICBC, Washington Mutual, and Wal*Mart to start-ups such as Datasage, Meals.com, PredictionWorks, VigLink and now at OpenGov.
Gabor holds a PhD in Computing Science from Simon Fraser University in the topic of document to ontology interlinking. He has been active in the data science community for over twenty years and is the recipient ACM SIGKDD's Service Award in 2013. His current research interest include iterative semantic semi-supervised text analysis and automated business process optimization.
Gabor Melli will be appearing as part of NLP Day Texas.
Jonathan Mugan (Austin) @jmugan
Jonathan Mugan (Linkedin) is Co-Founder and CEO at DeepGrammar. Dr. Mugan specializes in artificial intelligence and machine learning. His current research focuses in the area of deep learning, where he seeks to allow computers to acquire abstract representations that enable them to capture subtleties of meaning. Dr. Mugan received his Ph.D. in Computer Science from the University of Texas at Austin. His thesis was centered in developmental robotics, which is an area of research that seeks to understand how robots can learn about the world in the same way that human children do. Dr. Mugan also held a post-doctoral position at Carnegie Mellon University, where he worked at the intersection of machine learning and human-computer interaction. He is also the author of The Curiosity Cycle: Preparing Your Child for the Ongoing Technological Explosion.
Jonathan Mugan will be appearing as part of NLP Day Texas.
Jana Thompson (San Francisco)
Jana Thompson is an R&D Technology Associate Principal at Accenture. Prior to joining Accenture, Jana was an NLP Engineer at Idibon, where she developed the core technologies to build custom AI models for their customers’ problems. She has worked in artificial intelligence, speech recognition, and field linguistics, and has an MA in Germanic Studies, BS in mathematics, and BA in anthropology from the University of Texas at Austin. Jana frequently forages for edible plants in Golden Gate Park with her daughter in her quest to perfect her local-to-table dining experience.
Jana Thompson will be appearing as part of NLP Day Texas.
(NEW) Jacob Su Wang (Austin)
Jacob Su Wang works as a data scientist at OJO Labs. Inc., an Austin-based artificial intelligence startup, and is currently a second-year graduate student at the Department of Linguistics at the University of Texas at Austin, where he specializes in Computational Linguistics. Jacob now serves as a research assistant for Dr. Katrin Erk at UT, working on distributional semantics and Bayesian Hierarchical Models, with which he explores how humans can learn and use words appropriately with very little experiential exposure.
Before working at OJO, Jacob studied general linguistics (CS minor) and applied linguistics at the University at Buffalo (SUNY) and Yunnan University (China), before graduating with two M.A.s in linguistics. He also holds a B.S. in Informatics.
Jacob will be holding the following session: Exploring Question-Answering System: Named Entity Recognition & Sentence Similarity Measure in Practice.
Graph Day Speakers
Graph Day will be held on the first floor of the conference facility, concurrent with Data Day Texas. Your Data Day Texas ticket gets you into all of the Graph Day sessions. Below is a list of the Graph Day speakers confirmed so far. For a list of confirmed sessions, visit the Graph Day Sessions page.
(NEW) - Dave Bechberger (Houston)
Dave Bechberger is an Architect at Expero, a custom software development company building innovative solutions for domain experts across a variety of industries, from geophysicists to supply chain planners. He has spent 2016 building graph solutions for a variety of customers, and come to know the good, the bad, and the ugly of this technology platform.
Dave Bechberger will present the following talk: Moving Your Data To Graph.
Ryan Boyd (SF Bay)
Ryan Body (Linkedin) is a SF-based software engineer focused on helping developers understand the power of graph databases. Previously he was a product manager for architectural software, built applications and web hosting environments for higher education, and worked in developer relations for twenty products during his 8 years at Google. He enjoys cycling, sailing, skydiving, and many other adventures when not in front of his computer.
Arnaud De Moissac (Paris)
Arnaud De Moissac (Linkedin) is co-founder of DCbrain, an enterprise scale IoT startup, focused on delivering real time intelligence to multi physical network (energy, cooling, ...) by modeling flows. Arnaud has 10 years of experience in telco and IT networks, and hold several patents. He also worked during 5 years in the energy efficiency area. He holds two masters degree, in electrical engineering and IT architecture
Arnaud De Moissacr will be appearing as part of Graph Day 2017.
(NEW) - Alex Dimakis (Austin) @alexb80
Alex Dimakis (linkedin) is an Associate Professor in the Dept. of Electrical and Computer Engineering at the University of Texas. He is also a member of the Wireless Networking and Communications Group and the Computer Science Graduate Studies Committee. Alex's interests include information theory, coding theory, and machine learning.
Alex's recent publications include: Beyond Triangles: A Distributed Framework for Estimating 3-profiles of Large Graphs and Batch Codes through Dense Graphs with High Girth. For a list of publications, view Alex's homepage at UT.
Alex Dimakis will be appearing as part of Graph Day 2017.
Sebastian Good is a creative software architect with a focus on turning industry innovation and intellectual property into new products. He leads Expero's architecture and development practice, recruiting switched-on developers, building inventive prototypes, and shipping code for customers.
Sebastian Good will present the following talk: Time Series and Audit Trails: Modeling Time in an Industrial Equipment Property Graph.
Dr. Denise Koessler Gosnell (Charleston) @DeniseKGosnell
Dr. Denise Gosnell, a driving member of the PokitDok Data Science team since 2014, has brought her research in applied graph theory to help architect the graph database while also serving as an analytics thought leader. Her work with the Data Science team aims to extract insight from the trenches of hidden data in healthcare and build products to bring the industry into the 21st century. She has represented PokitDok's Data Science Team at numerous conferences including, PyData, KDD (Knowledge Discovery & Data Mining) and the inaugural GraphDay.
Prior to PokitDok, Dr. Gosnell earned her Ph.D. in Computer Science from the University of Tennessee. Her research on how our online interactions leave behind unique identifiers that form a “social fingerprint” led to presentations at major conferences from San Diego to London and drew the interest of such tech industry giants as Microsoft Research and Apple. Additionally, she was a leader in addressing the underrepresentation of women in her field and founded a branch of Sheryl Sandberg's Lean In Circles.
Denise Gosnell will present the following talk: Graphs vs. Tables: Ready, Fight!..
(NEW) - Borislav Iordanov (Hollywood, Florida) @ bolerio
Borislav Iordanov Borislav Iordanov is currently the VP of Engineering at Grakn Labs, an open source knowledge graph. He is also an entrepreneur and independent researcher. Under his leadership, he has lead a number of a commercial and open-source projects, including founding the innovative, one of a kind NoSQL database HyperGraphDB. Over a period of an 8 year involvement in e-government at Miami-Dade County, several of his initiatives led to nationwide recognition: Best Integrator from the Center for Digital Government for semantic publishing and search; Florida Sterling Innovations Award for PKBI, a content management tool blending text with a formal ontology for contextualized multi-channel delivery; NACo Achievement Award from the National Association of Counties for the Economic Service Bot, an autonomous virtual agent for self-servicing of new local small businesses; finally, the ontology-driven OpenCiRM platform powering the Miami-Dade 311 call center, architected by Mr. Iordanov and implemented under his lead, achieved recognition as semi-finalist in the 2015 Innovations in American Governments Awards Program from the Harvard Kennedy School Ash Center.
Borislav Iordanov will be presenting the following session: Large Scale Graph Analytics Through Graql.
Chris LaCava has spent the past two decades defining, designing and building software for a variety of industry verticals. He has experience as a usability engineer, interaction designer, front-end developer as well as product manager for both consulting and product-oriented organizations. Chris leads Expero's efforts in defining visualization for graph datasets.
Chris LaCava will present the following talk: Meaningful User Experience with Graph Data.
Corey Lanum, has a distinguished background in graph visualization. Over the last 15 years he has managed technical and business relationships with dozens of the largest defense and intelligence agencies in North America, in addition to working with many security and anti-fraud organizations in private industry. Prior to joining Cambridge Intelligence as their US Manager, Corey was helping the customers of i2 (now IBM) and SS8 to solve their most complex graph data challenges. Corey is the author the Learning Graph Visualization from Manning Publications.
Corey will present the following talk: Graphs in time and space: A visual example/a>.
William Lyon (SFBay) @lyonwj
William Lyon is a software developer at Neo4j, the open source graph database. As an engineer on the Developer Relations team, he works primarily on integrating Neo4j with other technologies, building demo apps, helping other developers build applications with Neo4j, and writing documentation. Prior to joining Neo, William worked as a software developer for several startups in the real estate software, quantitative finance, and predictive API fields. William holds a Masters degree in Computer Science from the University of Montana. You can find him online at lyonwj.com
William will be presenting the following session: Neo4j Graph Database Workshop For The Data Scientist Using Python. (90 minutes).
(NEW) - Alaa Mahmoud (Boston)
Alaa Mahmoud is a full-stack software developer with more than 25 years of experience, about 20 of those years are with IBM. He started his career focusing on software i18n. He moved on to work on various technologies such as web development, e-commerce and Customer Analytics software. Currently, Alaa is the dev lead for a team that's putting a Tinkerpop 3 based database on the cloud (IBM Graph). Alaa is also a master inventor with several granted patents. Check out Alaa's recent interview on Linux.com
Alaa will be presenting the following session: Building a Graph Database in the Cloud: challenges and advantages.
(NEW) - David Mizell (Austin)
David Mizell is a technical project lead at Cray, Inc. He started out his career doing parallel computing research, primarily at the Information Sciences Institute in Los Angeles. When the funding for that dried up, he moved to Seattle and Boeing. There he worked on Augmented Reality and Virtual Reality systems, doing some of the earliest prototyping of Augmented Reality systems and wearable computers. When the funding for that dried up, he joined Cray and resumed parallel computing research. Having finally found a stable source of research funding, he promptly worked himself out of it by prototyping a graph database system that Cray decided to productize. He now leads the development of Cray’s graph database product, the Cray Graph Engine. Currently he’s based in Cray’s Austin office.
David Mizell will be presenting the following session: LEBM: Making a Thoroughly Nasty Graph Database Benchmark.
(NEW) - Jean-Baptiste Musso (Paris) @jbmusso
Mo Patel (Austin) @mopatel
Mo Patel is a Senior Data Scientist at Think Big, A Teradata Company. Mo mentors clients across Americas on topics of Machine Learning, Data Science, Data Engineering and Artificial Intelligence. These mentoring engagements range from helping clients build large scale streaming analytics solution for deriving business value from sensor data to helping clients reduce customer churn and improve product stickiness via graph analytics. Mo is constantly evaluating the rapidly changing landscape of analytic libraries, tools, methods and frameworks in order to separate hype from reality for his clients. Current research interests are Sensor Data, Low-Latency Analytics, Deep Learning and Artificial Intelligence.
Mo has a Masters in Computer Science from Brandeis University, Bachelors in Math & Computer Science from College of the Holy Cross and MBA from Georgetown University. Mo is a Boston native, living in Austin and loves snow sports (not in Austin) and in order to maintain that addiction exercises and spends time outdoors (in Austin).
Mo Patel will be appearing as part of Graph Day 2017.
Josh Perryman (Bryan / College Station) @joshperryman
Josh Perryman likes to play with data. Oftentimes this is implementing proprietary algorithms closer to the data for performance or scale. Sometimes it is ad-hoc investigation and analysis, a sort of exploratory querying. A few times he’s been able to leverage his experience with data engines for dramatic performance improvements. But the real joy is designing a schema for both functionality and performance, one which increases the productivity of other developers and enables a technology to solve new problems or deliver new value to the business.
But technology isn't just data, and he does more than just play with data. He’s worked with high performance computing (HPC) environments, taking computations from hours to minutes or seconds. He has built visualizations which deliver new insights into complex data domains. He’s managed technology personnel, both directly and indirectly, to deliver technology solutions. He’s have put together more types of technology components, software and hardware, than can be counted, because one of his fortes is solving problems by building sustainable systems.
Josh Perryman will be appearing as part of Graph Day 2017.
Jason Plurad (Raleigh-Durham) @pluradj
Jason Plurad is a software developer on IBM's Open Technologies team. He is a committer on Apache TinkerPop, an open source graph computing framework. Jason engages in full stack development (including front end, web tier, NoSQL databases, and big data analytics) and promotes adoption of open source technologies into enterprise applications, service, and solutions. He has spoken previously at IBM conferences (Innovate, Insight) and Triangle Hadoop Users Group meetups.
(NEW) - Haikal Pribadi (London) @ haikalpribadi
Haikal Pribadi is the Founder of GRAKN.AI, a distributed knowledge graph. His interest in AI began in at the Monash Intelligent Systems Lab, where he created an open source driver for the Parallax Eddie Robot which allows user interaction through gesture and speech and was then adopted by NASA for research. He graduated top of his class in Computer Science at Monash University and obtained a masters degree in AI at the University of Cambridge on a full scholarship. Haikal then joined Quintiq as the youngest Algorithm Expert behind their Optimisation Technology R&D that helped schedulings of world’s largest companies. He now works on Grakn and Graql, the graph query language for deep network data.
Haikal Pribadi will be presenting the following session: How to Manage and Harness Large-Scale Graph Data with Grakn.
Dr. Juan Sequeda is the co-founder of Capsenta and the developer of Ultrawrap, a system that virtualizes relational databases as graph data sources. His research interests are on the intersection of Logic and Data and in particular between the Semantic Web and Relational Databases for data integration. Juan holds a Ph.D. in Computer Science from the University of Texas at Austin. Capsenta is a spin-off from his PhD research. Juan is the recipient of the NSF Graduate Research Fellowship, Best Student Paper at the 2014 International Semantic Web Conference, and 2nd Place in the 2013 Semantic Web Challenge for his work on ConstituteProject.org. Juan is on the editorial board of the Journal of Web Semantics and has been an invited expert member and standards editor for the World Wide Web Consortium (W3C) Relational Database to RDF Graph working group.
Juan Sequeda will be appearing as part of Graph Day 2017.
Check out our recent interview with Juan Sequeda.
(NEW) - Ted Wilmes @trwilmes
Ted Wilmes is passionate about learning complex systems top to bottom and he enjoys applying this knowledge to help customers with their data architecture and performance tuning needs. Over the past few years he has been involved in the rapidly growing graph database space and is an active committer and PMC member on the Apache TinkerPop project.
Ted Wilmes will be presenting the following session: Implementing Network Algorithms in TinkerPop's GraphComputer.
Chris Moody of Stitch Fix taking the audience in the weeds with recent NLP techniques at DDTX16.
Spark contributor Holden Karau packs the room at DDTX16.