Statistics with R on Large Data Sets
San Diego Supercomputer Center - Conference Room E-145
What the practical challenges of "Doing Statistics" on large data sets and what tools exist in R to help meet these challenges? In this talk, I will offer some ideas about how to think of "Big Data" from a statistical point of view, make some suggestions on computer architectures that facilitate working with R, and show some examples of R code working on large data sets.
Joseph is a Data Scientist and Community Manager at Revolution Analytics with a passion for analyzing data and teaching people about R. He is a regular contributor to the Revolutions blog and an organizer of the Bay Area R Users Group. Joe is a long-time Silicon Valley start-up guy with experience building statistical models in industries as diverse as local area networks and healthcare. Joseph holds graduate degrees in both the Humanities and Statistics. He taught statistics briefly at SJSU.