Posts

Torch for R on WSL2

This past spring my laptop of 5 years died and had to be replaced. Unlike the more forward thinking experts out there, I failed to hang on to most any installation scripts, so it’s taken some time to re-build my local work environment to my liking. I’ve made a commitment to organizing much more on the Windows Subsystem for Linux (Ubuntu 22.04 jammy jellyfish) instead of Windows this time for speed and code interoperability.

R Resources

My short list of useful books, courses, and expert blogs. Many are free. These could be departure points for your own learning journey. My hope is that you take and copy sections for sharing with your own community of practice. Yes, I am giving away all of the secrets. One of the things that I spend a lot of time thinking about is how to communicate about data science with other human beings.

mapboxapi

Mapbox web services APIs for #rspatial data science projects

Generations in America

The composition of American generations per Pew Research & the US Census

Animating 2021 Cycling Goals with rStrava

Cycling 🚲 popularity has grown in 2021 as evidenced by local cycling club activity and US national surveys. Some of this may be related to changes like new e-bike configurations and city-wide bike share systems that make it accessible to more people. It’s also possible that post-COVID, many people still want fun & affordable exercise. Bike shops have been busy filling orders and making repairs. Lead times for some models are months long as global supply chains work to catch up.

Exploratory Data Analysis

More often than not, the most impactful analytics projects do not involve any artificial intelligence. The real financial return comes from insights delivered to the organization via Exploratory Data Analysis, or EDA. Analytics professionals who focus on decision science, that is, people that use data to provide ground truth to the business, must be able to articulate what the EDA process is and what the standards are for the process. Analysts should be the best people available to find meaning in the data.

rtweet

I was inspired by David Robinson’s Tidy Tuesday screencast to work with rtweet on familiar Twitter handles. Writing on rtweet is an amusing bit of productive procrastination while honing my R skills. The rtweet package creator and owner, Mike Kearney, has hosted data journalism workshops covering a wide range of uses for this kind of data. They include text sentiment, geospatial networks, time series, and even bot detection. Oscar Baruffa and Veerle van Son have written a bookdown pamphlet called Twitter for R Programmers as a starter guide for people new to the R community on Twitter.

Digital Signal Processing

Signals are everywhere around the world today. Audio communications are more critical than ever. Airplanes use signals in the air to obtain important information to ensure safety on a flight. Cell phones are pretty much a single small high performance digital signal processing device. They process our speech when we talk by removing background noise and echos that would distort the clarity. They obtain wifi signals to allow for web searches.

Ten Tidyverse Updates

Stay up to date with examples using the penguins dataset

Practical Machine Learning in R

First, a disclosure. I know the authors of Practical Machine Learning in R, as both professors Fred Nwanganga and Mike Chapple teach courses in the Notre Dame Mendoza MSBA program. It is a rare pleasure to be able to read a book and to hear the actual author’s voice in the words. After finishing the MSBA program, I felt compelled to take this book up and explore the topics again.