06. More on dplyr and reshaping data
This week …
More on dplyr: using across() to apply functions to multiple columns at once; using case_when() to create new variables based on conditions; more on reshaping data with pivot_longer(); more on filtering and selecting data.
Required Reading
- Karl W. Broman and Kara H. Woo “Data Organization in Spreadsheets,” The American Statistician 72, no. 1 (January 2, 2018): 2–10, doi:10.1080/00031305.2017.1375989.
tidyr
- Tidy data
- Pivoting
- (Optional.) Rectangling
stringr
- Regular expressions
- Read the help page for
str_detect():?stringr::str_detect, which can also be found here. Work through the examples one at a time and make a note of any that seem confusing.