June 19, 2010 – Talend released Hadoop-based data management software today as part of its Talend Integration Suite.

The Talend Integration suite is the first enterprise open source data integration solution to natively support Hadoop’s distributed computing architecture, according to the company release.

Hadoop is an open source platform for scalable and distributed data storage and data processing. Integrated into Talend’s Integration Suite, users are able to access, manipulate and store large volumes of data, says Yves de Montcheuil, vice president of marketing for Talend.

Talend’s Integration Suite now offers native support for Hadoop Distributed File System to provide access to Hadoop data and Hive, the database infrastructure built on top of Hadoop, for structured and complex data processing, according to the company release. Talend Integration Suite leverages Hadoop's MapReduce architecture for highly distributed data processing. And, Hadoop components generate native Hadoop code and run data transformations directly inside the Hive database for maximum scalability.

There is high demand for processing large amounts of data and the incorporation of analytics, says de Montcheuil. “Hadoop has probably the most mature architecture to do that type of processing.”

“As is true with many bleeding edge technologies, Hadoop is a relatively new concept, but it’s based on MapReduce that’s been around for quite some time and deployed by a number of large enterprises,” says de Montcheuil. “Right now, we thought that it was mature and powerful enough to invest in and we are seeing very strong adoption.”

That adoption is part of the fuel behind the cloud computing fire, according to Steve Miller, co-founder of business intelligence services firm OpenBI, LLC. “Cloud computing and open source software are fueling the data and analytics binge,” he wrote in a blog this spring. “With the Hadoop platform, Visa recently mined two years of data, over 73 billion transactions amounting to 36 terabytes. The processing time dropped from one month to 13 minutes.”

Click here to read Jim Kobielus’ take on Hadoop and data mining, “Mining the Cloud? Are You Allowed?

Register or login for access to this item and much more

All Information Management content is archived after seven days.

Community members receive:
  • All recent and archived articles
  • Conference offers and updates
  • A full menu of enewsletter options
  • Web seminars, white papers, ebooks

Don't have an account? Register for Free Unlimited Access