My 2020 Tale

Photo by Jeff Cooper on Unsplash
First 5 Rows of Data Set for Project #4
  • First, using the provided data set, I calculated the return on investment (ROI) of each zip code in the US from the time period following the 2008 market crash until mid-2018. The zip codes were then ranked according to the ROI in descending order.
  • The second step was to conduct market research to identify the real estate markets forecasted to have high growth potential. Using the ROI data and market research, I choose to concentrate on the top 11 zip codes in the San Jose, Phoenix, and Las Vegas markets for modeling in step 3.
  • The third and final step used statistical analysis on the data to identify trends and then model the data with S/ARIMA to forecast for potential growth in 2019 and 2020.
  • Mountain View, CA (94043) with 60%
  • Sunnyvale, CA (98089) with 54%
  • Palo Alto, CA (94301) with 33%
  • Palo Alto, CA (94302) with 33%
  • Las Vegas, NV (89104) with 31%
  • Las Vegas, NV (89107) with 30%
Heart Disease Data Set from Kaggle
Number of Patients With and Without CVHD at Age
  • Logistic Regression
  • Random Forest
  • K-Nearest Neighbor (KNN)
  • Support Vector Machine (SVM)
Web Scaping Coffee Review URLs
Pre-Clean Scraped Coffee Data
Using Regex to Extract the Two Numbers from ‘Agtron’ Column
Two New Columns with the Agtron Whole and Agtron Ground
Country Dictionary
Add Number of Growers/Country to World Data
New Data Set with ‘number of growers’
GeoPandas Code
Result of Coffee Growers/Country
  • Logistic Regression
  • Random Forest
  • Random Forest with SMOTE
  • Random Forest with GridSearch
  • XGBoost
1. aroma
2. flavor
3. body
4. aftertaste
5. acidity
6. the price per ounce
9. Kenya
My Github Contributions
Photo by Conscious Design on Unsplash
My Peloton Data from a CSV file
September 19, 2020 Challenge Ride
Use of del Keyword
Use of drop() Function
  1. Continue to be the gatekeeper to my happiness
  2. Find a meaningful job in Data
  3. Get vaccinated!
Photo by Kym MacKinnon on Unsplash




Love podcasts or audiobooks? Learn on the go with our new app.

Recommended from Medium

Unveil DataScience/AI/ML Myths

Scalable methods for explaining machine learning

Data Viz Thoughts .|: Joshua Smith Interview & Soha Elghany has our #VOTW

Why You’re Not Data Driven — Part 2

Get Healthy with Capability Maturity Modeling

EC2 Instances FinOps and DataAnalysis

Explained: Predictive Analytics — Airbnb

Data science ethics book club #4 by DataKind UK

Get the Medium app

A button that says 'Download on the App Store', and if clicked it will lead you to the iOS App store
A button that says 'Get it on, Google Play', and if clicked it will lead you to the Google Play store
Annika Noren

Annika Noren

More from Medium

My Programming Story: Nine years ago, at the end of 2012, I decided to teach myself programming.

How is Coding Useful for Kids in Their Careers?

online coding classes

Stop trying to bend the fabrics of reality.

3 FUN WAYS TO LEARN PROGRAMMING (and not die trying)