Welcome!

Hi everyone and welcome to the super-fun R workshop on data mainpulation in R. Have you started to use R but have trouble getting your dataset into the right format? Do dates and times confuse you? Do you “fix” your data in Excel before uploading into R? Then this is the class for you!

Required software:

Required packages:

packages <- installed.packages()[,"Package"]
if(!is.element("tidyverse",packages)) install.packages("tidyverse")
if(!is.element("readxl",packages)) install.packages("readxl")
if(!is.element("hms",packages)) install.packages("hms")
if(!is.element("lubridate",packages)) install.packages("lubridate")

In this class, you will be learning how to download data, restructure it, clean it up, and merge different types of data together. We have broken it into modules. We may not get through them all today, but you can walk though them on your own afterwards.

Data:

We will be using the California Department of Fish and Wildlife Sacramento-San Joaquin Zooplankton data for this course. You may want to download the data ahead of time in case you have wi-fi issues. Data are available at the links below.

Other resources

  • Join the Data Science PWT for more help on these topics! Sign up here or visit us on GitHub.
  • rseek is a search engine that just searches R-related pages
  • R for Data Science is a free webbook with all kinds of useful resources for data manipulation and visualization.
  • Cookbook for R is a great site walking you through the basics.
  • Stack Overflow is the site for questions and answers on all types of coding topics.
  • Software Carpentry teaches a variety of data science workshops, and makes their lessons available online.

You can do it!

Remember, the difference between being good at coding and being bad at coding it being OK with copying and pasting from Google.