This is a companion glossary for a previous post on working with large data sets. Its purpose is to highlight the relevant arguments for dealing with reading and working with large data sets. Continue reading “Glossary for Working with Data sets”
Author: Kim Channell
Working with Large Data Sets
This tutorial provides a walk-through of managing a large data set in R. The sample data set used is on Precipitation in the Great Lakes Region retrieved from GLERL. It is a multi-tab excel file that needs to be cleaned up in R before it can be used efficiently. General methods of dealing with large datasets and the problems one can run into are included so that information in this tutorial can be applied to various types of data.