apache spark

Results 1 - 23 of 23Sort Results By: Published Date | Title | Company Name
By: TIBCO     Published Date: Nov 09, 2015
As one of the most exciting and widely adopted open-source projects, Apache Spark in-memory clusters are driving new opportunities for application development as well as increased intake of IT infrastructure. Apache Spark is now the most active Apache project, with more than 600 contributions being made in the last 12 months by more than 200 organizations. A new survey conducted by Databricks—of 1,417 IT professionals working with Apache Spark finds that high-performance analytics applications that can work with big data are driving a large proportion of that demand. Apache Spark is now being used to aggregate multiple types of data in-memory versus only pulling data from Hadoop. For solution providers, the Apache Spark technology stack is a significant player because it’s one of the core technologies used to modernize data warehouses, a huge segment of the IT industry that accounts for multiple billions in revenue. Spark holds much promise for the future—with data lakes—a storage repo
Tags : 
     TIBCO
By: Pure Storage     Published Date: Dec 05, 2018
With the growth of unstructured data and the challenges of modern workloads such as Apache Spark™, IT teams have seen a clear need during the past few years for a new type of all-flash storage solution, one that has been designed specifically for users requiring high levels of performance in file- and object-based environments. With FlashBlade™, it addresses performance challenges in Spark environments by delivering the consistent performance of all-flash storage with no caching or tiering, as well as fast metadata operations and instant metadata queries.
Tags : 
     Pure Storage
By: Pure Storage     Published Date: Jan 12, 2018
Apache Spark has become a critical tool for all types of businesses across all industries. It is enabling organizations to leverage the power of analytics to drive innovation and create new business models. The availability of public cloud services, particularly Amazon Web Services, has been an important factor in fueling the growth of Spark. However, IT organizations and Spark users are beginning to run up against limitations in relying on the public cloud—namely control, cost and performance.
Tags : data, storage, scalability, cost efficiencies, pure storage
     Pure Storage
By: TIBCO Software APAC     Published Date: Aug 15, 2018
TIBCO Spotfire® Data Science is an enterprise big data analytics platform that can help your organization become a digital leader. The collaborative user-interface allows data scientists, data engineers, and business users to work together on data science projects. These cross-functional teams can build machine learning workflows in an intuitive web interface with a minimum of code, while still leveraging the power of big data platforms. Spotfire Data Science provides a complete array of tools (from visual workflows to Python notebooks) for the data scientist to work with data of any magnitude, and it connects natively to most sources of data, including Apache™ Hadoop®, Spark®, Hive®, and relational databases. While providing security and governance, the advanced analytic platform allows the analytics team to share and deploy predictive analytics and machine learning insights with the rest of the organization, white providing security and governance, driving action for the business.
Tags : 
     TIBCO Software APAC
By: Pure Storage     Published Date: Oct 09, 2018
Apache® Spark™ has become a vital technology for development teams looking to leverage an ultrafast in-memory data engine for big data analytics. Spark is a flexible open-source platform, letting developers write applications in Java, Scala, Python or R. With Spark, development teams can accelerate analytics applications by orders of magnitude
Tags : 
     Pure Storage
By: IBM     Published Date: Jul 14, 2016
This video describes how data scientists, analysts and business users can save precious time by using a combination of SPSS and Spark to uncover and act on insights in big data.
Tags : ibm, data, analytics, predictive business, ibm spss, apache spark, coding, data science
     IBM
By: IBM APAC     Published Date: Aug 25, 2017
Machine learning automates the development of analytic models that can learn and make predictions on data. It has been one of the fastest growing disciplines within the world of statistics and data science, but the barrier to entry has been high, not only in cost, but also in the need for specialized talent.
Tags : machine learning, apache spark, additional resources, big data, ibm
     IBM APAC
By: IBM     Published Date: Oct 27, 2016
IBM Analytics for Apache Spark for Bluemix is an open-source cluster computing framework with in-memory processing to speed analytic applications up to 100 times faster compared to other technologies on the market today. Optimized for extremely fast and large scale data processing-you can easily perform big data analysis from one application.
Tags : ibm, apache spark, bluemix, analytics, data science
     IBM
By: IBM     Published Date: Nov 30, 2016
Learn how to create powerful analytic apps with IBM Cloudant, dashDB and Apache Spark. This presentation will contain demos of real-life use cases e.g. machine learning predictive analytics, Graph-parallel computation and more.
Tags : ibm, cloud, analytic apps, apps, cloudant
     IBM
By: IBM     Published Date: Jan 18, 2017
Data matters more than ever to business success. But value does not come from data alone. Rather, it comes from the insights enabled by data. No matter what your role is, or where you are in your data journey, you are looking for ways to drive innovation.
Tags : ibm, analytics, aps data, open data science, data science, apache spark
     IBM
By: BlueData     Published Date: Mar 13, 2018
In a benchmark study, Intel compared the performance of Big Data workloads running on a bare-metal deployment versus running in Docker containers with the BlueData software platform. This landmark benchmark study used unmodified Apache Hadoop* workloads
Tags : big data, big data analytics, hadoop, apache spark, docker
     BlueData
By: MemSQL     Published Date: Nov 15, 2017
FREE O'REILLY EBOOK: BUILDING REAL-TIME DATA PIPELINES Unifying Applications and Analytics with In-Memory Architectures You'll Learn: - How to use Apache Kafka and Spark to build real-time data pipelines - How to use in-memory database management systems for real-time analytics - Top architectures for transitioning from data silos to real-time processing - Steps for getting to real-time operational systems - Considerations for choosing the best deployment option
Tags : hardware trends, data pipelines, database management, architectures, technology
     MemSQL
By: IBM     Published Date: Jul 19, 2016
Watch to learn how an enterprise-grade, multi-tenant solution can help you deploy Spark in a production environment to take advantage of · Faster time-to-results for big data analytics · Simplified deployment and management · Increased utilization of hardware resources"
Tags : ibm, analytics, production environment, apache spark, idc research
     IBM
By: IBM     Published Date: Nov 07, 2016
Apache Spark hit the scene in 2014 and has grown to be the most popular software project in the history of Open Source. Attend this webinar and learn more about; -What is Apache Spark? -Why, is it so popular? -Why is it important to you and your organization? Apache Spark is allowing companies to drive innovative ways to compete using one of the most valuable assets in the 21st century, Data! Apache Spark is the fastest growing framework for powering Big Data Analytics today and for the future. Register to attend this webcast and learn more.
Tags : ibm, big data, platform computing, apache spark
     IBM
By: IBM     Published Date: Apr 07, 2017
Data science platforms are engines for creating machine-learning solutions. Innovation in this market focuses on cloud, Apache Spark, automation, collaboration and artificial-intelligence capabilities. We evaluate 16 vendors to help you make the best choice for your organization. This Magic Quadrant evaluates vendors of data science platforms. These are products that organizations use to build machine-learning solutions themselves, as opposed to outsourcing their creation or buying ready-made solutions.
Tags : data analytics, product refinement, business exploration, advanced prototyping, analytics, data preparation, customer support, sales relations, market research, model management
     IBM
By: IBM     Published Date: Jun 21, 2017
NoSQL databases and Apache Spark are a potent combination for rapid integration, transformation and analysis of all kinds of business data. With its data syncing and analytics capabilities, IBM Cloudant offers unique advantages as a NoSQL database for many Spark use cases. IT decision-makers, data scientists and developers need to know how and when to apply these technologies most effectively. IBM can offer a host of resources and tools to help your organization gain value from Cloudant and Spark quickly, and with minimal up-front investment.
Tags : ibm, ibm cloudant, apache spark, nosql, database
     IBM
By: IBM     Published Date: Jul 07, 2015
Life revolves around prediction—for example, the route you take to get to work, whether to go on a second date, or whether or not to keep reading this sentence are all forms of prediction. We are already seeing machine learning powered by Apache Spark changing the face of innovation at IBM. Learn more.
Tags : intelligent applications, machine learning, prescriptive analytics, real-time, natural language processing, automation
     IBM
By: IBM     Published Date: Oct 26, 2015
Big data is fueling a new economy—one based on insight. How can you create the valuable insights that are the currency for the new economy while controlling complexity? Apache Spark might be the answer.
Tags : ibm, apache spark, big data
     IBM
By: IBM     Published Date: Oct 26, 2015
Machine learning can help us plan our lives so we can increase our likelihood of success. We are already seeing machine learning powered by Apache Spark changing the face of innovation at IBM. Learn more.
Tags : ibm, machine learning, apache spark, business intelligence, intelligence, intelligence applications, big data, data
     IBM
By: IBM     Published Date: Jul 12, 2016
As most companies now realize, analytics is increasingly more of an integral part of their day-to-day business operations. In a recent survey by a global research firm, 80% of CIOs stated that transition from backward-looking, passive analysis must shift to forward-looking predictive analytics. The challenge is that many analytic solutions are aligned to a specific platform, tied to inflexible programming models and requiring vast data movement. In this webcast, Forrester and experts from IBM will highlight how technology like Apache Spark on z/OS allows businesses to extract deep customer insight without the cost, latency and security risks of data movement throughout the enterprise.
Tags : ibm, forrester, apache spark, spark technology, z systems, security
     IBM
By: Datastax     Published Date: Aug 15, 2018
Built on a production-certified version of Apache Spark™ and with integrated search and graph capabilities, DSE Analytics provides highly available, production-ready analytics that enables enterprises to securely build instantly responsive, contextual, always-on applications and generate ad-hoc reports. Read this white paper to learn about the specific features and capabilities of DSE Analytics, and why DSE Analytics is designed for the Right-Now Enterprise.
Tags : 
     Datastax
By: Pure Storage     Published Date: Apr 18, 2018
Apache Spark has become a critical tool for all types of businesses across all industries. It is enabling organizations to leverage the power of analytics to drive innovation and create new business models. The availability of public cloud services, particularly Amazon Web Services, has been an important factor in fueling the growth of Spark. However, IT organizations and Spark users are beginning to run up against limitations in relying on the public cloud—namely control, cost and performance.
Tags : 
     Pure Storage
By: Group M_IBM Q1'18     Published Date: Feb 14, 2018
Data science platforms are engines for creating machine-learning solutions. Innovation in this market focuses on cloud, Apache Spark, automation, collaboration and artificial-intelligence capabilities. We evaluate 16 vendors to help you make the best choice for your organization.
Tags : gartner, magic quadrant, data science platform, machine-learning
     Group M_IBM Q1'18
Search White Papers      

Add White Papers

Get your white papers featured in the insideBIGDATA White Paper Library contact: Kevin@insideHPC.com