Personal schedule for Tao Cheng
Download or
subscribe to Tao Cheng's
schedule.
MySQL Performance Schema is a new performance analyzes tool in MySQL 5.5, learn how to use it for Performance Optimization tasks.
Read more.
Apache Hadoop is a distributed, batch-processing system for large data sets. It can be used alongside relational databases to enable more effective reporting and injestion of large amounts of raw or unstructured data.
Read more.
Getting sharding right is crucial for achieving high scale with MySQL on commodity hardware like we do at Facebook. We will overview sharding best practices, and show some examples of both successful and unsuccessful methods at sharding MySQL.
Read more.
Production schema changes are painful but unavoidable. This session will tell you how to
minimize (or totally eliminate) downtime during schema changes with master-master setup or by using "shadow" tables.
Session will focus on pros and cons of each approach and describe most common use cases.
Read more.
It takes a lot to deliver consistent high performance for your MySQL powered system. In this presentation we'll look at defining Performance Goals, understanding Architecture Scalability and performing Capacity Planing.
Read more.
NoSQL
Location: Ballroom E
HBase is an open source column store built on top of Hadoop.
In this 45 minute session you will get a brief introduction into the design of HBase, and the underlying framework along with some usage examples.
Read more.
NoSQL
Location: Ballroom F
MongoDB -- from "humongous" -- is an open source, non-relational, document-oriented database.
Read more.
Running MySQL at a scale of Facebook leads to many unique problems. We will
discuss some of the performance problems seen in production systems and the tools and
techniques involved in solutions.
Read more.
While MySQL and MongoDB often fight over the same deployment, there are many cases where MySQL and MongoDB should be used in conjunction.
They each excel at different things, so its important to understand when to use one or the other, and how to make them work well together.
Eliot Horowitz is the CTO and Co-Founder of 10gen, the creators of MongoDB
Read more.
Apache CouchDB implements a reliable storage engine, webserver, and HTTP application server environment, in under 20k lines of Erlang and JavaScript source code (with an additional X lines of test code.)
I'll show 3 related examples that strike at the core of CouchDB's simplicity: The storage engine, the incremental map reduce views, and replication.
Read more.
GitHub's history with MySQL and what we've built off of it.
Read more.
AOL deployed its large scale Real Time News (RTN) system in 2007. This system receives news updates from over 30,000 sources on every second around the clock. Today, its data store, MySQL, has accumulated over several billions of rows and terabytes of data. However, news are delivered to end users in close to real time fashion. This presentation shares how it is done and the lessons learned.
Read more.
A real-world example of how re-sharding and table partitioning cut load data times in Facebook's analytics infrastructure from greater than 24 hours to less than 5 minutes.
Read more.
The “NoSQL” movement is typically related to key-value systems and, lacking a formal definition, can be interpreted many ways. NoSQL discussions that focus on availability and scalability highlight ACID issues but not really SQL.
The key-value systems can be built many ways and relational databases as a back end is a serious contender.
Read more.
With contemporary web applications, data is never isolated to one store. Memcached has long been a partner to MySQL; now Membase, a persistent, replicated, clustered memcached-protocol-compatible datastore is used alongside MySQL for simple, fast key-value access. This session will dispel the idea of needing to choose between SQL or NoSQL, showing how you can be both rich and fast.
Read more.
The last few years have brought technological and market shifts that have disrupted open-source databases. These include cloud computing, solid-state storage, non-SQL databases, and MySQL's acquisition. In this keynote presentation, Baron Schwartz will discuss the new reality
that faces open-source database users and developers.
Read more.
The meteoric growth of MySQL through the 1990s and 2000s were marked by some big in the enterprise database market -- a willingness to adopt open source software for critical business applications, and the emergence of a new class of database-backed web applications that needed a simpler, cheaper and more flexible storage model than the established vendors provided.
Read more.
Antony shows the OQGRAPH engine in MariaDB 5.2, allowing you to deal with graphs (networks) for social networking, hierarchies and other complex structures, all in plain clean SQL.
Read more.
NoSQL
Location: Ballroom A
People talk about NoSQL in the context of distributed cloud-based web applications, but what if your application needs to be deployed throughout rural Africa, with limited computer resources, intermittent power, and above all, extremely unreliable internet? This talk discusses the features of CouchDB that make it uniquely suited for developing world health applications.
Read more.
ScaleDB is a pluggable storage engine for MySQL. It turns MySQL into an enterprise-class, highly-available, clustered database that scales dynamically in a public and private cloud.
Read more.
Most MySQL deployments use some form of replication. Whether the reason is availability, scalability, backup, disaster recovery or archiving, understanding the performance characterization and trade-offs is critical to design and planning.
If you have always wondered when and why to choose async/semi-sync/sync MySQL replication, DRBD or other interoperable technologies, this session is for you.
Read more.
The Facebook database engineering team works with the community and on its own to make MySQL better for data center deployments. This work is visible in the Facebook patch, bugs fixed in official MySQL and features sponsored in other distributions. We will describe work to support a large number of large databases. We focus on backup, replication and quality of service.
Read more.