


Jeremy Leipzig is a bioinformatics software developer at DuPont Crop Genetics. He has conducted academic research in viral integration, metagenomics, schizophrenia, and alternative splicing.
Meer over de auteursData Mashups in R
A Case Study in Real-World Data Analysis
Samenvatting
Data analysis is more than means and standard deviations. This book is a case study of how you van push R into new territory to analyze online real-world data. The authors scrape public foreclosure records for Philadelphia, geocode them, plot them bij Neighborhood, and analyze the results, using R facilities to interact with web servers, parse HTML and XML, and more.
- Parse messy data from public foreclosure auction postings
- Plot the data using R's PBSmapping package
- Import US Census data to add context to foreclosure data
- Use R's lattice and latticeExtra packages for data visualization
- Create multidimensional correlation graphs with the pairs() scatterplot matrix package
Specificaties
Over Xiao Yi Li
Inhoudsopgave
-Messy Address Parsing
-Shaking the XML Tree
-The Many Ways to Philly (Latitude)
-Exceptional Circumstances
-Taking Shape
-Developing the Plot
-Turning Up the Heat
2. Statistics of Foreclosure
-Importing Census Data
-Descriptive Statistics
-Descriptive Plots
-Correlation
-Final Thoughts
Appendix: Getting Started