IBM launched a development environment for Apache Spark, which it’s delivering in the cloud for rapid adoption.

The company says it’s the first cloud-based development environment for near real-time, high performance analytics, giving data scientists the ability to access and ingest data and deliver models to developers.

Available on the IBM Cloud Bluemix platform, the Data Science Experience provides 250 curated data sets, open source tools and a collaborative workspace designed to help data scientists uncover and share meaningful insights with developers. That will make it easier to quickly develop applications.

Building on its $300 million investment in developing Apache Spark as a type of “analytics operating system,” IBM says it created the Data Science Experience to extend the speed and agility of Spark to more than two million members of the R community through new contributions to SparkR, SparkSQL and Apache SparkML.

As a result, data scientists who work in R will have faster access to more data and more insights delivered from the IBM Cloud.

The Data Science Experience’s open and collaborative environment will enabled data scientists to accelerate and simplify data ingestion, curation and analysis by bringing together the content, data, models and open source resources from IBM and others, the company says.

“With Apache Spark, we see an opportunity to significantly transform the role of the data scientist by providing access to curated data sets, open source tools and a collaborative platform to accelerate innovation,” said Bob Picciano, senior vice president, IBM Analytics.

Register or login for access to this item and much more

All Information Management content is archived after seven days.

Community members receive:
  • All recent and archived articles
  • Conference offers and updates
  • A full menu of enewsletter options
  • Web seminars, white papers, ebooks

Don't have an account? Register for Free Unlimited Access