JAN 23, 2014 1:00pm ET

Related Links

Ellison Becomes Oracle Chairman as Catz, Hurd Split CEO Job
September 18, 2014
Stop Governing Your Data - Start Leading Data Behaviors and Outcomes
September 18, 2014
Big Data Gets Bigger Footprint in Insurance
September 16, 2014

Web Seminars

Essential Guide to Using Data Virtualization for Big Data Analytics
September 24, 2014
Integrating Relational Database Data with NoSQL Database Data
October 23, 2014

Dueling R and Python, Contest 1A


I decided to expedite my promise from a blog 2 weeks ago of “porting several soup-to-nuts R examples I’ve evolved over the years to a comparable 2014 Python environment”, to show the emerging similarity of Python and R platforms for data analysis/statistics. There’s more than a little motivation in the fear of being late to the game.

Get access to this article and thousands more...

All Information Management articles are archived after 7 days. REGISTER NOW for unlimited access to all recently archived articles, as well as thousands of searchable stories. Registered Members also gain access to:

  • Full access to information-management.com including all searchable archived content
  • Exclusive E-Newsletters delivering the latest headlines to your inbox
  • Access to White Papers, Web Seminars, and Blog Discussions
  • Discounts to upcoming conferences & events
  • Uninterrupted access to all sponsored content, and MORE!

Already Registered?

Filed under:


Comments (3)
This compares a package in Python (Pandas) to base R. Very disappointed you haven't mentioned the data.table package at all or its fread function which is much faster and easier to use in the case you've gone to some lengths here to highlight.

Why not mention data.table or fread?

Posted by Matthew D | Friday, January 24 2014 at 9:31AM ET
Matthew -- thanks for your response.

I'm a big data.table fan and have written on it in several blogs. I have not worked with fread but will take a look. As you we'll know, a big challenge for R analysts is to stay up on the 4000+ packages that are not part of R core.

A reader of the column 2 weeks ago took me to task for being too pro-R! The reason that column was popular is that there's lots of interest in both R and Python for analytics.

Why don't you do the exercise using data.table and fread and send me the code? I'll run it on my computer and report the results to IM readers."


Posted by steve m | Friday, January 24 2014 at 10:06AM ET
Add Your Comments:
You must be registered to post a comment.
Not Registered?
You must be registered to post a comment. Click here to register.
Already registered? Log in here
Please note you must now log in with your email address and password.
Login  |  My Account  |  White Papers  |  Web Seminars  |  Events |  Newsletters |  eBooks
Please note you must now log in with your email address and password.