Revolution Analytics Targets R Language, Platform at Growing Need to Handle 'Big Data' Crunching

Updated: August 04, 2010

With RevoScaleR, we've focused on making analytical models not just scale to the big data sets, but run the analysis in a fraction of the time compared to traditional systems," says David Smith, vice president of Community and Marketing at Revolution Analytics. "For example, the FAA publishes a data set that contains every commercial airline take off and landing between 1987 and 2008. That's more than 13 gigabytes of data. By analyzing that data, we can figure out the likelihood of airline delays in one second."

A rows-and-columns approach

One second to analyze 13 GB of data should turn some heads because it takes 300 seconds with traditional methods. Under the hood of RevoScaleR is rapid fire access to data. For example, the RevoScaleR uses an XDF file format, a new binary big data file format with an interface to the R language that offers high-speed access to arbitrary rows, blocks and columns of data.

We've taken that one step further to develop a system that accesses the database by rows and columns at the same time



"The new SQL movement was all about going from relational databases to a flat file on a disk that offers fast to access by columns. A lot of the technology that's behind things like Twitter and Facebook take this approach," Smith said. "We've taken that one step further to develop a system that accesses the database by rows and columns at the same time, which is really well-attuned to doing these statistical computations."

RevoScaleR also relies on a collection of the most-common statistical algorithms optimized for big data, including high-performance implementations of summary statistics, linear regression, binomial logistic regression and crosstabs. Data reading and transformation tools let users interactively explore and prepare large data sets for analysis. And, extensibility lets expert R users develop and extend their own statistical algorithms.

Featured Research
  • Eight Ways You Should Be Using Contact Center Reporting

    Every day, your contact center collects critical data that can be used to drive strategic improvements to your efforts in the future. But that data is meaningless if you don’t know how to access and analyze it. The key to do doing both is using reporting features. By understanding how to use reporting tools, you will gain much greater insight from the data you are collecting. more

  • Is Your Phone System Stealing Profits?

    Having the wrong phone system can dramatically cut into your profits. Despite this, many businesses just sign up for a plan or platform that seems ‘good enough’. If you haven’t carefully considered your options and the included features, there’s a very good chance that you are leaving money on the table in some way. more

  • Best Video Conferencing Features for Business

    Most businesses are currently underutilizing their video conferencing software because they aren’t aware of the different ways it can be used. Understanding the different features of video conferencing software can be critical to getting the most out of your investment. These features often vary from one option to the next as well, so it's important to do your homework before choosing a specific service. more

  • Phone System Technology Showdown

    VoIP and IP telephony are often misconstrued as being the same type of phone system, but the truth is they operate on different technology and deployment methods. This guide will explain the differences between VoIP and IP, go into the pros and cons of both VoIP and IP-PBX, and give insight into which type of phone system will benefit your business the most. more

  • 8 Ways the Cloud is Changing ERP for the Better

    What if there was a tool available that allowed for you to save up to a quarter of your operational costs? Studies have shown that Enterprise Resource Planning (ERP) solutions enable businesses to access accurate, real-time information about daily operations which allow for the reduction of operational costs of up to 23% and administrative costs of up to 22%. more