When it comes to big data and cloud computing, it looks like more and more pipelines will funnel into so-called data lakes. Generally speaking, a data lake is a storage repository that holds lots of data in its native format until it's needed. But what problems do they solve -- and what new challenges might they introduce for data scientists and business analysts?

First, the good news. Data lakes hope to solve two problems -- one old and one new, according to Gartner:

Register or login for access to this item and much more

All Information Management content is archived after seven days.

Community members receive:
  • All recent and archived articles
  • Conference offers and updates
  • A full menu of enewsletter options
  • Web seminars, white papers, ebooks

Don't have an account? Register for Free Unlimited Access