Some of the featured topics at Data Day.
Here is a rough guide to the topics covered at Data Day. We hope to have the time and room numbers up in the next day or so.
This is not all of the talks. We are still waiting on the flight itinerary and the final abstracts for a few of the speakers.
A/B Testing
Nelson Ray (Open Door) : When A/B Testing Fails: A Case Study in Real Estate
Apache Cassandra
Michelle Casbon (Qordoba) : Untangling the Ball of Strings: Machine Learning for Localization
Dor Laor (ScyllaDB) : ScyllaDB dance like butterfly sting like bee
Patrick McFadin (DataStax) : Choose your own Time Series adventure
2:00pm - Patrick McFadin (DataStax) : Time for a new relation: Going from RDBMS to Graph
Apache Spark
Michelle Casbon (Qordoba) : Untangling the Ball of Strings: Machine Learning for Localization
Gergely Svigruha : Demography Estimation on a Large Telco Graph – a Case Study
11:30am - Mo Patel (Think Big (Teradata)) : Traversing our way through Spark GraphFrames and GraphX
5:10pm - Borislav Iordanov (GRAKN.AI) : Large Scale Graph Analytics Through Graql
Artificial Intelligence
Jonathan Mugan (Deep Grammar) : How to Progress from NLP to Artificial Intelligence
Data and the Public Sphere
Jonathon Morgan (New Knowledge) : Truth is Dead
9:50am - William Lyon (Neo4j) : Make Graphs Great Again - Analyzing Election Data Using Neo4j
Data Science
Tim Gasper : Robot farmers and chefs: In the field and in your kitchen
Ryan Mitchell (LinkeDrive, Inc) : Web Scraping in a JavaScript World
Melissa Santos (Big Cartel) : Distances, Similarities, and Scores: Practical Model Examples
Stefan Krawczyk (Stitch Fix) : Scaling Data Science at Stitch Fix
Michelle Casbon (Qordoba) : Untangling the Ball of Strings: Machine Learning for Localization
Nelson Ray (Open Door) : When A/B Testing Fails: A Case Study in Real Estate
Brendan Herger (Capital One) : Machine Learning with Opponents
9:50am - Juliet Hougland (Cloudera) : How to Observe: Lessons from Epidemiologists, Actuaries and Charlatans
Data Visualization
9:50am - Corey Lanum (Cambridge Intelligence) : Graphs in time and space: A visual example
1:10pm - William Lyon (Neo4j) : Neo4j Graph Database Workshop For The Data Scientist Using Python
2:00pm - Chris LaCava (Expero) : Meaningful User Experience with Graph Data
Data Wrangling
Ryan Mitchell (LinkeDrive, Inc) : Web Scraping in a JavaScript World
Databases
Dor Laor (ScyllaDB) : ScyllaDB dance like butterfly sting like bee
Paul Dix (Influx) : Towards an open standard for metrics and time series
Paul Dix (Influx) : Structured Logs with InfluxDB - Log management at scale without full text search
Data Ops
Laine Campbell (N/A) : Database Reliability Engineering
Deep Learning
Brendan Herger (Capital One) : Machine Learning with Opponents
Pierre Gutierrez (Dataiku) : Pragmatic Deep Learning for image labelling. An application to a travel recommendation engine
DevOps
Laine Campbell (N/A) : Database Reliability Engineering
Docker
Michelle Casbon (Qordoba) : Untangling the Ball of Strings: Machine Learning for Localization
Stefan Krawczyk (Stitch Fix) : Scaling Data Science at Stitch Fix
Graph Databases - Cypher
1:10pm - William Lyon (Neo4j) : Neo4j Graph Database Workshop For The Data Scientist Using Python
2:50pm - Juan Sequeda (Capsenta) : Graph Query Languages
Graph Databases - General
Ted Wilmes (Expero) : Time Series and Audit Trails: Modeling Time in an Industrial Equipment Property Graph
9:50am - Corey Lanum (Cambridge Intelligence) : Graphs in time and space: A visual example
9:50am - Juan Sequeda (Capsenta) : Do I need a Graph Database
10:40am - Dave Bechberger (Expero) : Moving Your Data To Graph
11:30am - Luca Garulli (OrientDB) : Graph Databases: what's next?
11:30am - Mo Patel (Think Big (Teradata)) : Traversing our way through Spark GraphFrames and GraphX
1:10pm - Denise Gosnell (PokitDok) : Graphs vs Tables: Ready? Fight.
2:00pm - Patrick McFadin (DataStax) : Time for a new relation: Going from RDBMS to Graph
2:00pm - Chris LaCava (Expero) : Meaningful User Experience with Graph Data
2:50pm - Alaa Mahmoud (IBM) : Building a Graph Database in the Cloud: challenges and advantages
2:50pm - Juan Sequeda (Capsenta) : Graph Query Languages
2:50pm - David Mizell (Cray) : LEBM: Making a Thoroughly Nasty Graph Database Benchmark
4:20pm - Haikal Pribad (GRAKN.AI) : How to Manage and Harness Large-Scale Graph Data with Grakn
5:10pm - Borislav Iordanov (GRAKN.AI) : Large Scale Graph Analytics Through Graql
Graph Databases - Multimodal
9:50am - Juan Sequeda (Capsenta) : Do I need a Graph Database
10:40am - Jason Plurad (IBM) : Enabling a Multimodel Graph Platform with Apache TinkerPop
11:30am - Luca Garulli (OrientDB) : Graph Databases: what's next?
2:50pm - Juan Sequeda (Capsenta) : Graph Query Languages
Graph Databases - Neo4j
9:50am - William Lyon (Neo4j) : Make Graphs Great Again - Analyzing Election Data Using Neo4j
10:40am - Ryan Boyd (Neo4j) : Graphs + Sensors = The Internet of Connected Things
1:10pm - William Lyon (Neo4j) : Neo4j Graph Database Workshop For The Data Scientist Using Python
1:10pm - Denise Gosnell (PokitDok) : Graphs vs Tables: Ready? Fight.
Graph Databases - Tinkerpop
10:40am - Jason Plurad (IBM) : Enabling a Multimodel Graph Platform with Apache TinkerPop
11:30am - Mike Downie (Expero) : Implementing Network Algorithms in TinkerPop's GraphComputer
2:50pm - Alaa Mahmoud (IBM) : Building a Graph Database in the Cloud: challenges and advantages
Graph Databases - Use Cases
Gergely Svigruha : Demography Estimation on a Large Telco Graph – a Case Study
Humans in the Loop
Nicholas Gaylord (Crowdflower) : Machine learning with humans in the loop
Gabor Melli : Predictive Models for Inter-Linking Text-Rich Semantic Databases
Image Recognition
Pierre Gutierrez (Dataiku) : Pragmatic Deep Learning for image labelling. An application to a travel recommendation engine
IoT / Sensors
Tim Gasper : Robot farmers and chefs: In the field and in your kitchen
Ted Wilmes (Expero) : Time Series and Audit Trails: Modeling Time in an Industrial Equipment Property Graph
10:40am - Ryan Boyd (Neo4j) : Graphs + Sensors = The Internet of Connected Things
Machine Learning
Nicholas Gaylord (Crowdflower) : Machine learning with humans in the loop
Brendan Herger (Capital One) : Machine Learning with Opponents
Tim Gasper : Robot farmers and chefs: In the field and in your kitchen
Michelle Casbon (Qordoba) : Untangling the Ball of Strings: Machine Learning for Localization
Natural Language Processing
Tim Gasper : Robot farmers and chefs: In the field and in your kitchen
Michelle Casbon (Qordoba) : Untangling the Ball of Strings: Machine Learning for Localization
Jason Kessler (CDK) : Scattertext: A Tool for Visualizing Differences in Language
Jonathan Mugan (Deep Grammar) : How to Progress from NLP to Artificial Intelligence
Rob McDaniel (Live Stories) : Bootstrapping a corpus: how to build a rich topic library from a handful of words
Jacob Su Wang (OjoLabs) : Exploring Modeling Methods in Named Entity Recognition
Sanghamitra Deb (Accenture) : Creating Knowledgebases from text in absence of training data
Gabor Melli : Predictive Models for Inter-Linking Text-Rich Semantic Databases
Jonathon Morgan (New Knowledge) : Truth is Dead
Neural Networks
Alex Korbonits (Remitly) : Distilling dark knowledge from neural networks
Brendan Herger (Capital One) : Machine Learning with Opponents
Ontologies
Gabor Melli : Predictive Models for Inter-Linking Text-Rich Semantic Databases
4:20pm - Haikal Pribad (GRAKN.AI) : How to Manage and Harness Large-Scale Graph Data with Grakn
Python
Ryan Mitchell (LinkeDrive, Inc) : Web Scraping in a JavaScript World
Jason Kessler (CDK) : Scattertext: A Tool for Visualizing Differences in Language
1:10pm - William Lyon (Neo4j) : Neo4j Graph Database Workshop For The Data Scientist Using Python
Recommender Systems
Pierre Gutierrez (Dataiku) : Pragmatic Deep Learning for image labelling. An application to a travel recommendation engine
Time Series Data
Patrick McFadin (DataStax) : Choose your own Time Series adventure
Paul Dix (Influx) : Towards an open standard for metrics and time series
Paul Dix (Influx) : Structured Logs with InfluxDB - Log management at scale without full text search
Ted Wilmes (Expero) : Time Series and Audit Trails: Modeling Time in an Industrial Equipment Property Graph
Training Data
Melissa Santos (Big Cartel) : Distances, Similarities, and Scores: Practical Model Examples
Alex Korbonits (Remitly) : Distilling dark knowledge from neural networks
Jacob Su Wang (OjoLabs) : Exploring Modeling Methods in Named Entity Recognition
Sanghamitra Deb (Accenture) : Creating Knowledgebases from text in absence of training data
Nicholas Gaylord (Crowdflower) : Machine learning with humans in the loop
Pierre Gutierrez (Dataiku) : Pragmatic Deep Learning for image labelling. An application to a travel recommendation engine