Need help? Call (405) 753-5300 x102 or contact sales@onesite.com

Latest News

ONEsite Message Center Goes Express

By ONEsite Team

Jan 10, 2011 12:00 AM CST

Happy New Year everyone! Here at ONEsite, we constantly strive to increase performance and optimize our client and end users’ experience. Last summer, we targeted the message center as a key system which could use optimization. The Message Center is employed by users as a private messaging system; it is also used by network administrators as a key way to communicate with their users, either through our control panel interfaces or our APIs. As such, the Message Center is both a popular user feature and one that must scale to millions of message recipients for one message.

One of the major architectural pieces of our platform that contributes to our scalability is our functional partitioning of key systems and logical sharding onto isolated highly available (HA) database clusters. When developing new systems isolating backend storage is trivial – just define a pool and start storing data. However, as was the case with the message center, migrating data from one cluster to a new cluster without having downtime can be rather tricky.

Through our analysis, we identified the following issues:

Message data accounted for nearly 50% of the size of one database cluster
Duplication of large data accounts for rapid, unnecessary growth
Key index optimizations were missing

Based on this, we set the following goals:

Isolate the message system into its own cluster
Optimize the schema to reduce table locking
Reduce the data size and duplication of data
Accomplish the migration with little or no downtime

I’m pleased to report that after a long process of conversion, optimization and testing we have officially switched to the new backend for the message center. While this process proved to be quite tedious (and time consuming due to the initial size of the database – some alters took several hours to run!), the end result has been quite positive. Over the first few days we have seen both our overall database loads decrease, and the speed of the message center has increased. For those of you just wanting a high level overview:

We moved data around and changed it to increase performance and it worked. Thanks for reading, drive home safe!

For those wanting the nitty-gritty details, here you go:

We started by dumping the schema only and all of the raw data in sql format as two separate dumps. From there, we restored the schema to another server and began to optimize the table structures. This was done through code analysis and heavy use of the “Explain Plan”. Once the initial alters were done, we loaded that schema combined with the original data onto empty MySQL 5.0 and 5.1 servers. At this point, we had the following rough statistics:

Raw SQL: 87 GB
Ibdata File: 134GB

We then fired up replication (using some table limiting and schema rewrite magic) and watched. After about a month of monitoring and analysis, we decided to go with MySQL 5.1 (with an ibdata file per table). This was chosen primarily because of the ability to split disk IO across multiple files. Side by side, the IO wait with a single ibdata file vs multiple files was quite evident. Once we landed on a server version, the data conversion began.

The main feature was removing duplicated message bodies while compressing them (i.e. if a user sends the same message twice, there is no need to store two copies of that message). So, we wrote a custom job that read the message body of every message record and stored/compressed a single copy while updating the table to only have the message hash. At first glance the savings here were amazing (although it took roughly a week to complete the job in its entirety). We then wrote and ran a few other custom jobs to populate some of the new fields we added during our analysis (these were mainly moved from table to table to eliminate joins – think speed here) and prune old data.

At this point, it was time to use the built in MySQL replication to our advantage. We updated our code to write to both the old and the new database for certain tables and stopped replicating those tables while we used replication to handle writes to other tables. This made our coding changes minimal. Once we had this in place, we ran alters on some tables to drop the now unused columns. Now, with our schema finalized, we starting replaying all the read traffic from our production databases against this new cluster. After making the decision that we were pleased with the performance, it was time to pull the trigger. We deployed our final code and severed replication from the original shared cluster, isolating all message traffic to an independent HA cluster.

Here are our final stats after pruning and compressing along with a load graph showing the decrease in server load on our core cluster:

Ibdata File(s) Total: 54GB (roughly 40% of the original size)

This undertaking was quite a challenge that taught us several lessons along the way, but was entirely worthwhile. Hopefully, as we continue to refine and optimize our application, this process can serve as a model for our future projects.

Comments

Articles

Minor Styling Updates to the Control Panel

By adminuser

5:08PM - Mar 26, 2015

We've made a few minor styling updates within the control panel for mobile purposes. Clients should not be effected. Read More

147 Views

Platform Update: April 19th, 2013

By adminuser

7:14PM - Apr 19, 2013

ONEsite has a standard platform update scheduled for Friday, April 19th 2013. This release includes a variety of updates, performance enhancements and bug fixes including: * Ability to attach documents to forum threads and posts* Bugfix for... Read More

239 Views

ONEsite Uptime Report: January 2013

By Andy

9:54PM - Feb 5, 2013

ONEsite maintained a 100% uptime for the month of January 2013 Read More

298 Views

ONEsite Uptime Report: December 2012

By Andy

2:49PM - Jan 2, 2013

ONEsite maintained a 100% uptime for the month of December 2012 Read More

310 Views

ONEsite Uptime Report: November 2012

By Andy

9:26PM - Dec 4, 2012

ONEsite maintained a 100% uptime for the month of November 2012 Read More

283 Views

Platform Update: December 3rd, 2012

By Andy

9:22PM - Dec 4, 2012

ONEsite Platform Update for December 3rd, 2012 Read More

201 Views

Platform Update: April 17th, 2012

By Derrick

6:27PM - Apr 16, 2012

ONEsite has a standard platform update scheduled for Tuesday, April 17th 2012. This release includes a variety of control panel updates as well as other general platform enhancements for single sign on, web services and other bug fixes. Full... Read More

568 Views

PAX East

By Andy

2:26PM - Apr 6, 2012

ONEsite will be attending Pax East 2012 from April 6-8. We're at booth 1165 in the Boston Convention and Exhibition Center. Stop by for more information about our single sign on and social network software products as well as our role as a hosting... Read More

387 Views

ONEsite SXSW Interactive Festival 2012

By Andy

5:09AM - Mar 7, 2012

ONEsite will be attending SXSW 2012 from March 9 through March 16. You can find our trade show booth (#1026/1028) next to the Meet Up Pavilion. If you’re attending SXSW or even just in the area we’d love to talk. Read More

679 Views

ONEsite API change: onesite.display_user (XML-RPC)

By Ralph

6:48PM - Feb 27, 2012

ONEsite will be updating the onesite.display_user XML-RPC API call in order to remove extraneous characters from the NAME tag for the fields listed below. This may require you to update your SSO integration code if you have any custom logic in... Read More

932 Views