> White Papers > TIBCO > InsideBIGDATA: An Insider’s Guide to Apache Spark
 

InsideBIGDATA: An Insider’s Guide to Apache Spark

By: TIBCO
TIBCO

As one of the most exciting and widely adopted open-source projects, Apache Spark in-memory clusters are driving new opportunities for application development as well as increased intake of IT infrastructure. Apache Spark is now the most active Apache project, with more than 600 contributions being made in the last 12 months by more than 200 organizations. A new survey conducted by Databricks—of 1,417 IT professionals working with Apache Spark finds that high-performance analytics applications that can work with big data are driving a large proportion of that demand. Apache Spark is now being used to aggregate multiple types of data in-memory versus only pulling data from Hadoop.  For solution providers, the Apache Spark technology stack is a significant player because it’s one of the core technologies used to modernize data warehouses, a huge segment of the IT industry that accounts for multiple billions in revenue.

Spark holds much promise for the future—with data lakes—a storage repository that holds a vast amount of raw data in its native format until it is needed. With Spark’s speed and scalability, data lakes can offer the enterprise a framework for virtually unlimited capacity.

Tags : 


* Please enter your email address and click the Download Now button to download the white paper.

 Email this page
Published:  Nov 09, 2015
Length:  9
Type:  White Paper