George Hou
  • My Links
    GitHub Medium Linkedin Twitter Calendly
  • About Me / Contact

George Hou


Data Science Engineer

George Hou


Data Science Engineer

Data Engineering with Millionsong Dataset

Using AWS to perform ETL

Posted on April 30, 2020

The purpose of this project is to understand what, where and how each user is listening to the songs in the meta data generated base on the Million Song Dataset. The analytial goals is to find out what is making the free tier users switch to paid tier and why... [Read More]
Tags: SQL ETL AWS spark redshift

Data Modeling with Cassandra

Query data with partition key

Posted on March 6, 2020

Using Python to create an ETL pipeline for data modeling with Apache Cassandra. [Read More]
Tags: SQL NoSQL Data Modeling Jupyter Cassandra

Consumer Complaints Data Transformation

Using Python Script to transform data

Posted on February 23, 2020

For this project using only built-in Python libraries, we want to know for each financial product and year, the total number of complaints, number of companies receiving a complaint, the company with the most complaints, and the highest percentage of complaints directed at a single company. [Read More]
Tags: ETL Government Public Data Python Script Linux

Exploratory Analysis of Apple Mobile App Reviews

Extracting data from iTunes API to Data Visualization

Posted on January 19, 2020

[Read More]
Tags: API data visualization mobile apple jupyter notebook

Analyzing Yelp Dataset with Scattertext

Exploratory data analysis and visualization for text data using NLP

Posted on November 18, 2019

One of the most crucial work in the text mining field is to present the content of the text data visually. Using natural language processing (NLP), a data scientist can summarize documents, create topics, explore storylines of the content in different angles and scope of details. [Read More]
Tags: big data visualization yelp business jupyter notebook
  • Older Posts →

George Hou  •  2021  •  georgehou2008@gmail.com

Theme by beautiful-jekyll