HBaseCon 2015 Speakers

(Continuously updated as speakers confirm)


Seshu Adunuthula - eBay
Seshu is Director of Analytics Platform at eBay, responsible for managing some of the world's largest deployments of Hadoop, Teradata, and ETL Ingest infrastructure. He is an industry veteran with over 20 years of distributed computing and analytics experience. Most recently prior to eBay, he worked at MapR, responsible for the MapReduce, MapR-DB, and MapR Control System teams.
Nitin Aggarwal - Rocket Fuel
Nitin is a Software Engineer at Rocket Fuel where he builds data applications using Apache HBase, MapReduce, YARN, and Apache Storm to enable faster access and easier analysis of petabytes of data. He has also contributed to developing scalable monitoring and alerting infrastructure for the company using HBase and OpenTSDB.
Jesse Anderson - Smoking Hand
Jesse is a Creative Engineer with years of experience in creating products and helping companies improve their software engineering. He strives to provide developers with the resources to learn new technologies and improve their skillsets. To help the local community, he volunteers time as the President of the Northern Nevada Software Developers Group.
Vandana Ayyalasomayajula - Yahoo!
Vandana is a software engineer at Yahoo!. She currently works for the HBase team building various multitenancy features for HBase. She is a contributor to HBase and was also a PMC member and Committer for Apache HCatalog. Prior to Yahoo!, Vandana was a graduate student at UC Irvine.
Clay Baenziger - Bloomberg
Clay leads the Hadoop Infrastructure team at Bloomberg. Clay comes from a diverse background in systems infrastructure and analytics. At Sun Microsystems, his team built out an automated bare-metal Solaris deployment tool for Solaris engineering labs and his contributions were core to the OpenSolaris Automated Installer.
Matteo Bertozzi - Cloudera
Matteo Bertozzi is a Software Engineer at Cloudera, and an HBase committer/PMC member.
Matt Blair - Flipboard
Matt is a Software Engineer at Flipboard and has worked on infrastructure there since 2011, with a recent focus on building backend services that leverage its data pipeline and various distributed databases, including HBase.
Sean Busbey - Cloudera
Sean is a committer on the Apache HBase project. He is also a PMC member on Apache Accumulo. He currently works at Cloudera as a Software Engineer on the storage team.
Aaron Carreras - FINRA
Aaron is director of enterprise data at FINRA. His team works on all aspects of data coming in and out of FINRA. For the past two years, his focus has primarily been on design, development, and rollout of FINRA's first two HBase-backed applications in the cloud. He has spent his entire career working on applications and data in the Finance space.
Jeremy Carroll - Pinterest
Jeremy is one of the foundational members of the Site Reliability Engineering team at Pinterest. He helps design, build, and monitor Pinterest's applications and systems infrastructure that currently handles billions of monthly page views with tremendous growth and scalability challenges.
Poorna Chandra - Cask
Poorna is a Software Engineer at Cask where he is responsible for building software fueling the next generation of data applications. Prior to Cask, he developed Big Data infrastructure at Greenplum and Yahoo!
Tian-Ying Chang - Pinterest
Tian-Ying is currently working in the storage and caching infrastructure at Pinterest; she also built the Disaster Recovery framework for HBase there and is responsible for HBase performance and reliability. Prior to Pinterest, she worked at eBay and Microsoft on Hadoop/HBase, Windows, and Halo.
Ishan Chhabra - Rocket Fuel
Ishan is a Technical Lead at Rocket Fuel, with a focus on building the next generation of real-time storage and processing systems. Hadoop, HBase, Storm and Clojure are his tools of choice for tackling complexity and scalability challenges of storing and analyzing petabytes of data generated and stored at Rocket Fuel. Prior to Rocket Fuel, he worked at Bell Labs to enable privacy in large-scale recommendation systems using distributed middleware, acquiring a patent in the process.
Sang Chi - Flipboard
Sang is currently running the data and search platform at Flipboard, focused on running scalable data solutions/infrastructure for analytics, products and search. He has used HBase since 2010 across user graph, magazine storage, metrics, personalized feed, feature extraction, ranking, ads, and more.
Taejin Chin - DaumKakao
Taejin is a Software Engineer at DaumKakao. He built a distributed graph database for social graph data. He is also interested in graph theory, applied algorithms, and problem solving.
Elliott Clark - Facebook
Elliott is an engineer at Facebook on the Apache HBase team. He's also an HBase PMC member and committer.
Dave Coyle - Dropbox
Dave was the first Hadoop SRE at Dropbox, which has a small team focusing on HDFS and HBase operations and reliability. Prior to that, he worked on Hadoop and other systems at Spotify, Morgan Stanley, and other companies.
Jianwei Cui - Xiaomi
Jianwei is a software engineer at Xiaomi in China. His work focuses on the development and improvement on Apache HBase.
Jason Culverhouse - Flipboard
Jason Culverhouse is a Software Engineer at Flipboard.
Nick Dimiduk - Hortonworks
Nick is an Apache HBase commiter, PMC member, and a co-author of HBase in Action. He works on the HBase team at Hortonworks, where his focus is on usability and performance.
Jingcheng Du - Intel
Jingcheng works for Intel Big Data Team as a Senior Software Engineer. He has worked on developing HBase features since 2012 and is also an HBase contributor.
Solomon Duskis - Google
Solomon has been working on HBase since October 2014 and has focused on efforts relating to the HBase 1.0 client standardization efforts. He works for the Google Bigtable team.
Venkata Deepankar Duvvuru - Rocket Fuel
Venkata is a Software Engineer at Rocket Fuel where he builds large-scale data and serving applications using Apache HBase, MapReduce, Thrift, and Clojure. In the past year, he has worked on JVM tuning for providing better guarantees in latencies while serving. Prior to Rocket Fuel, he interned at Google and INRIA.
Abraham Elmahrek - Cloudera
Abraham is a Software Engineer at Cloudera. He is a member of the Apache Sqoop PMC and a committer on the Apache HTrace (incubating) project.
Ian Friedman - Yahoo!
Ian is a Senior Software Engineer on Flurry's Platform Team. He works primarily on Flurry's data ingestion and metrics aggregation pipeline, which continuously processes over 20 TB of mobile analytics event data per day. He also helps manage and troubleshoot Flurry's 2,000+ node Hadoop/HBase cluster.
Rahul Gidwani - Yahoo!
Rahul is an engineer on the platform team at Flurry/Yahoo!. For the past few years, he has been working with HBase and scaling Flurry's cluster.
Arnab Guin - GE
Arnab is a Staff Software Engineer, Big Data with General Electric's Predix Big Data Platforms group. His work focuses on developing and designing platforms encompassing high-speed ingestion, storage, and analytics. Prior to GE, Arnab worked on distributed genome sequencing algorithms at Complete Genomics and developed high-speed data pipelines at Tivo for high-volume viewership data.
Andrey Gusev - Sift Science
Andrey is ML infrastructure tech lead at Sift Science and enjoys machine learning, search, NLP, and distributed systems. Before Sift, Andrey was a lead engineer at Salesforce.com working on search and machine learning systems.
Gary Helmling - Cask
Gary is a Committer and PMC member for the Apache HBase project. He works on HBase and Apache Hadoop development at Cask (formerly Continuuity), and has contributed to security, coprocessors, and the RPC stack. In past roles, Gary has worked at Twitter, Trend Micro, and Meetup.
Lars Hofhansl - Salesforce.com
Lars is an Apache HBase Committer and PMC member. He is an Architect at Salesforce.com, where he leads HBase development efforts, recently forcusing on performance, backup, and disaster recovery. In the past, Lars held engineering roles at Peoplesoft and Digital Equipment Corp.
Jonathan Hsieh - Cloudera
Jonathan is a Software Engineer with Cloudera, currently focused on the HBase project. He is an HBase committer and PMC member, a committer and founder of the Apache Flume project, and a committer on the Apache Sqoop project.
Matthew Hunt - Bloomberg
Matthew works on systems architecture for Bloomberg and its Portfolio Analytics product, which comprises real time and historical analytics for returns, risk, optimization, and attribution. He has been lucky enough to have been the CTO at several startups, and served as the president of LUNY!, the Linux Users of New York.
Julian Hyde - Hortonworks
Julian, an architect at Hortonworks, is an expert in database architecture, query optimization, and in-memory analytics. He is the original developer of the Apache Calcite query-planning framework, an Apache Drill committer, and lead developer of the Mondrian OLAP engine.
Rohit Jain - Hewlett-Packard
Rohit, Database Distinguished & Chief Technologist at HP, leads an effort to build Big Data Apache Hadoop solutions while leveraging Apache HBase. He has also served as a solutions architect and a database consultant, developer, architect, development and QA manager, and product manager.
Dr. Ricardo Jimenez-Peris - LeanXcale
Ricardo is CEO and cofounder of LeanXcale. He is an expert on scalable transactions, co-author of a book on scalable database replication, 100+ papers at international conferences and journals, and co-inventor of several patents. He is a member of the expert group advising the European Commission on Cloud Computing.
Eric Kaczmarek - Intel
Eric is a Senior Java Performance Architect in the Software Solution Group at Intel. For the better part of the last 10 years, he focused on optimizing the Java Virtual Machine for Intel Architectures. Because of his deep and broad Java Virtual Machine expertise, Eric leads the effort to enable and optimize Big Data frameworks such as Apache Hadoop and Apache HBase for Intel-based platforms.
Sudarshan Kadambi - Bloomberg
Sudarshan is an Architect at Bloomberg helping evolve Bloomberg's Data and Compute infrastructure. He has a background in distributed systems from his days at Stanford and Yahoo!. He has been a user of Hadoop since 2008 and is passionate about making it awesome.
Hirotaka Kakishima - CyberAgent
Hirotaka is a database engineer at CyberAgent. He has administrated HBase clusters for 2 years. He is a co-author of Beginner's Guide to HBase (Japanese language), which was released through Shoeisha in 2015.
Ido Karavany - Intel
Ido is a Big Data Analytics Architect and Development Manager in Intel's Advanced Analytics group. He is responsible for leading-edge technology projects within Intel involving Big Data and stream analytics solutions in the Internet of Things and Parkinson's disease research.
Virag Kothari - Yahoo!
Virag works for the HBase team at Yahoo!, where his current focus is on challenges related to scalability and multitenancy. He is an HBase committer and a committer/PMC member for Apache Oozie.
Swarnim Kulkarni - Cerner
Swarnim is a Lead Architect with the Big Data team at Cerner Corporation. At Cerner, his team is focused on designing and development of infrastructure for ingestion of healthcare data in the cloud using Apache Hadoop technologies. He is also a contributor to Apache Hive.
Chris Larsen - Yahoo!
Chris is a software engineer at Yahoo! working on the monitoring team to store and process time-series data at a massive scale. He coordinates development on OpenTSDB and AsyncHBase with a great community of users and contributors. Previously, he helped publish OpenTSDB 2.0 while working at Limelight Networks.
John Leach - Splice Machine
With over 15 years of software experience under his belt, John's expertise in analytics and BI drives his role as CTO. Prior to Splice Machine, John founded Incite Retail and led the company's strategy and development efforts. Prior to Incite Retail, he ran the business intelligence practice at Blue Martini Software and built strategic partnerships with integration partners.
Cosmin Lehene - Adobe
Cosmin is a senior computer scientist in Adobe's Analytics Platform team, working on distributed infrastructure for the Adobe Marketing Cloud. His past work includes a real-time distributed OLAP cube on top of HBase, a real-time video QoS analytics service, and Adobe Analytics Video Heartbeats.
Jimmy Lin - University of Maryland
Jimmy is an Associate Professor at the University of Maryland. From 2010-2012, he spent an extended sabbatical at Twitter working on analytics infrastructure and various data products.
Francis Liu - Yahoo!
Francis is a Principal Software Engineer at Yahoo! working mainly on Apache HBase. He is also an Apache Hive contributor. Prior to that, he was involved in the development of a workflow management and incremental processing platform built on top of Apache Hadoop.
Shaohui Liu - Xiaomi
Shaohui is interested in distributed computing and storage systems. Currently, he focuses on the application and operation of HBase at Xiaomi. Prior to Xiaomi, he worked on an in-house MapReduce implementation and cluster management system at Tencent.
Xun Liu - Pinterest
Xun is a software engineer on the infrastructure team at Pinterest. He worked in many areas and is currently focusing on storage and caching solutions. Before Pinterest, Xun was a staff software engineer at Google and worked on display ads and search quality.
Max Luebbe - Google
Max is a Site Reliability Engineer at Google's New York City office. In this role he is responsible for running a handful of services you probably use every day, specifically with regards to their availability and reliability. Prior to working at Google, he cofounded Pip.io, a social web startup in Palo Alto, CA.
Maxim Lukiyanov - Microsoft
Maxim is a program manager on the Big Data team at Microsoft. He is responsible for the Apache HBase cluster type in Azure HDInsight, focusing primarily on optimizing HBase for cloud environment.
David MacKenzie - Box
David is a Staff Software Engineer at Box, where he's spent the past three years working on the infrastructure powering the company's desktop sync experience. He's currently building out Box's new HBase-backed guaranteed-delivery messaging infrastructure. Prior to Box, David worked at a small mobile telecom company building 3G network switches.
Ted Malaska - Cloudera
Ted is a Solutions Architect at Cloudera. He has 18 years of professional experience working for startups, the U.S. Federal Government, some of the world's largest banks, and the U.S.'s largest non-profit financial regulator. Ted is a regular contributor to Apache Flume, Apache Avro, Apache Pig, and YARN.
Colin McCabe - Cloudera
Colin is a Platform Software Engineer at Cloudera, where he works on HDFS and related technologies. He is a committer on HDFS. Prior to joining Cloudera, he worked on the Ceph Distributed Filesystem, and the Linux kernel, among other things. He studied Computer Science and Computer Engineering at Carnegie Mellon.
Jacques Nadeau - MapR
Jacques is MapR's lead developer on the Apache Drill open source project. He is an industry veteran with over 15 years of big data and analytics experience. Most recently, he was cofounder and CTO of search engine startup YapMap. Before that, he was director of new product engineering with Quigo (contextual advertising, acquired by AOL in 2007). He also built the Avenue A | Razorfish analytics data warehousing system and associated services practice (acquired by Microsoft).
Shyam Varan Nath - GE
Shyam is a Big Data & Analytics Architect working at GE. His primary focus is Industrial Internet related solutions for aviation. Prior to GE, Shyam worked for IBM, Oracle, and Deloitte. He has over 23 years of industry experience in areas like data warehousing and advanced analytics.
Brock Noland - StreamSets
Brock is an engineer at StreamSets, an Apache Flume, Hive, Crunch, MRUnit, and Parquet (incubating) PMC member, and a mentor to Apache Nifi (incubating). Prior to StreamSets, he was an engineering manager at Cloudera.
Carter Page - Google
Carter Page is an engineer and manager on the Bigtable development team at Google in New York City. For the last 19 years, Carter has worked on high-performance distributed software across several industries, including media, finance, and education.
Joey Parsons - Flipboard
Joey is on the operations team at Flipboard.
Raghavendra Prabhu - Pinterest
Raghavendra Prabhu (aka RVP) manages the infrastructure team at Pinterest, which is responsible for core backend infrastructure including storage systems, caching, service framework, and core business logic. Prior to Pinterest, RVP worked for many years on storage and search infrastructure at Twitter, Google, and Microsoft.
Andrew Purtell - Salesforce.com
Andrew is a committer and PMC Chair for the HBase project, and is an Architect at Salesforce.com working on cloud storage. Previously, Andrew worked at Intel, Trend Micro, Sparta, and McAfee.
Anoop Sharma - Hewlett-Packard
Anoop is the lead Architect for the Trafodion program. He has worked in the areas of database technologies for many years at HP and has led design, development, and performance improvements for multiple database products.
Benoît Sigoure - Arista Networks
Benoît is the creator of OpenTSDB and AsyncHBase - although OpenTSDB is now largely maintained by Chris Larsen, since the 2.0 release. Benoît currently works on new distributed systems at Arista Networks, where HBase plays a central role.
Enis Söztutar - Hortonworks
Enis is a Member of the Technical Staff at Hortonworks, an Apache HBase, Apache Hadoop, and Apache Gora committer, and a member of the Apache Software Foundation. He has been using and developing Hadoop ecosystem projects since 2007.
Michael Stack - Cloudera
Michael is an engineer on Cloudera's HBase team. He was the first project chair for HBase and is currently a committer/PMC member for that project, as well as a member of the Hadoop PMC.
Misty Stanley-Jones - Cloudera
Misty is a senior technical writer at Cloudera, working on documentation for Apache HBase, CDH, and other storage-related projects. She has been heavily involved in Linux and open source since 1996. In past lives, she managed the middleware technical writing staff at Red Hat, wore a sysadmin hat for several years, and hacked the Solaris kernel for a while.
Alan Steckley - Salesforce.com
Alan is a Principal Software Engineer at Salesforce. He works with Hadoop and HBase to build Marketing Cloud platform services.
Toshihiro Suzuki - CyberAgent
Toshihiro joined CyberAgent in 2008. He is in charge of a log analysis system using Apache Hadoop and Apache Hive and a graph database built on Apache HBase. He is co-author of Beginner's Guide to HBase (Japanese language), which was released through Shoeisha in 2015.
James Taylor - Salesforce.com
James is an architect at Salesforce.com in the Big Data Group. He founded the Apache Phoenix project and leads the development effort. Prior to working at Salesforce.com, James worked at BEA Systems on projects such as a federated query processing system and an event driven programming platform and has worked at various other start-ups in the computer industry over the past 20+ years.
Jimmy Xiang - Cloudera
Jimmy is a Software Engineer at Cloudera, and an HBase committer/PMC member.
Maryann Xue - Intel
Maryann is a software engineer on the Big Data Technologies team at Intel and a PMC member of the Phoenix project.
Liqi Yi - Intel
Liqi is a senior Java Performance engineer at Intel's Software Solution Group. He has extensive experience with HBase performance optimization, Java Garbage Collection tuning, and hardware platform characterization.
Doyung Yoon - DaumKakao
Doyung started his career at Google as a Software Engineer, and has worked for years on search engine and data mining. These days, he's fascinated by large-scale distributed systems.
ABOUT      AGENDA      SPONSORS      SPEAKERS      ARCHIVES       CODE OF CONDUCT
©2014 HBaseCon. Cloudera, Inc. All rights reserved. Terms & Conditions. Apache HBase, HBase, Apache Hadoop, and Apache are either registered trademarks or trademarks of the Apache Software Foundation in the United States and/or other countries, and are used with permission. The Apache Software Foundation has no affiliation with and does not endorse, or review the materials provided at this event, which is managed by Cloudera.