I decided to expedite my promise from a blog 2 weeks ago of “porting several soup-to-nuts R examples I’ve evolved over the years to a comparable 2014 Python environment”, to show the emerging similarity of Python and R platforms for data analysis/statistics. There’s more than a little motivation in the fear of being late to the game.

My first example draws on a medium-sized data set from U.S. Census Bureau’s Annual American Community Survey sample of households and individuals that provides a wealth of information on population demographics, income, education, residence, family characteristics, etc. over time. I’ve worked with versions of Public Use Microdata Sample (PUMS) for over 5 years, finding the data highly-informative and a more-than-toy technical test as well.

Register or login for access to this item and much more

All Information Management content is archived after seven days.

Community members receive:
  • All recent and archived articles
  • Conference offers and updates
  • A full menu of enewsletter options
  • Web seminars, white papers, ebooks

Don't have an account? Register for Free Unlimited Access