Celine Wang

Logo

Data Scientist, holding multiple certificates in data analysis and machine learning. With a background in chemical research and education. Proficient in Mandarin and Japanese. Detail orientated, embraces challenges and eager to learn new skills.

View My LinkedIn Profile
View My GitHub Profile
View My Tableau Public

Portfolio


Data Management

Skills

PostgreSQL, Azure SQL, MySQL, APIs, NoSQL(MongoDB), SQLite, ETL, Beautiful Soup, Spark, Big data, Hyperion

Projects

  1. Data Queries with Data Engineering Advanced SQL queries and subqueries associated with windows functions was designed and implemented to collect student demographics and program success metrics.
  2. Movies ETL Pipeline Extract and transform Movie data from different sources including Wikipedia, Kaggle metadata and local file. well-structured dataset (26M records) was loaded into SQL for further queries.
  3. Amazon Vine Review Big Data Analyze Amazon Office Products reviews via AWS RDS to determine if there is any prior bias toward leaving reviews from Amazon Vine.
  4. Plan My Trip APIs Recommend hotels in target city and travel routes based on customer’s weather preferences.
  5. Mars Web Scaping Reorganize the Mars information from multiple websites into a customized flask app.


Machine Learning

Skills

Scikit-learn, Tensorflow, Numpy, Pandas, Seaborn, VS Code, Jupyter Notebook, Google Colab

Projects

  1. FoodMart Media Campaign Predict store sales and media cost with multiple linear regression. Classify customer membership with deep neural network.
  2. MercadoLibre_Financial_Forecast Based on google search data, company stock price, and revenue data to get insights and forecasting for the better plan.
  3. Credit Risk Prediction Predict customer credit risk with logistic regression and classifier models using resampling methods. Compare the model perform metrics before and after scaling.
  4. Cryptocurrencies Clustering Cluster cryptocurrencies using KMeans Clustering from the view of elbow curve.
  5. Charity Foundation Predict funding success with Neural Network and optimize the model by re-preprocessing data and fine tuning hyperparameters.


Data Visualization

Skills

Flask, Matplotlib, Tableau, HTML, CSS, Javascript, Plotly, Leaflet

Projects

  1. Project Log Analyzed team organization and hour tracking with Power BI, including time lapse, search bar, keyword lists, and hierarchical filters.
  2. SuperStore Analysis Show patterns and trends of sales, products, and customer analysis on Tableau.
  3. Funding Startups Target startup companies with threshold of expenses, revenue, and top growth.
  4. Biodiversity Ploty Dashboard Create an interactive dashboard to display bacteria cultures of each test object.
  5. Mapping Earthquake Visualize earthquakes in past 7 days around the world and its relation to tectonic plates.


Data Mining and Statistic Analysis

Skills

Python, Pandas, Numpy, R, MS Excel, SPSS

Projects

  1. Employee Attrition Prediction Discover the relationship between duration of employment and employee’s demographics. Based on result, predict employee attrition by multiple ML model.
  2. Stock Analysis Excel VBA Create stock index and calculate total daily volume and returns.
  3. MechaCar R Statistical Analysis Determine significant features that mostly impact MPG with multiple linear regression. Identify product quality from the statistics summary and t-test.
  4. Chemistry teaching efficacy Analysis Developed and analyzed surveys on high school chemistry teaching efficacy using SPSS. Conducted exploratory and statistical analyses (t-tests, ANOVA, factor analysis) and incorporated interviews and observations to propose improvements from a psychological perspective.


Programming

Skills

C++, Python, Java, JavaScript (HTML, CSS, Flask), Data Structures, Algorithms, Pointers, R, Assembly language

Projects

  1. Banking Create a superclass Account and subclass Checking and Saving to allow customer to create Checking and Saving account with a unique ID for transactions independently and interactively.
  2. Election Analysis Calculate the number of votes and vote turnout for each candidate to determine a local congressional election results.

Cloud Computing

Skills

AWS, Google Cloud Planform, Terroform, Docker, Kubernetes

Projects

  1. Covid Testing MultiCloud Deploy hotel customer covid test system docker container application running in the Google cloud, whereas saving customer’s privacy on AWS.


Work Experience

Research Assistant III
San Diego College of Continuing Education | 01/2024 - present
Instructional Specialist (Learning Assistant)
Data Analytics and Visualization in 2U(edX) | 03/2023 - 01/2024
Mandarin Chinese Teacher
Tri-Cities Chinese Language School | 01/2019 - 12/2019
R&D Chemist and Technical Consultant
Nikken Chemical Laboratory Co., Ltd., Japan | 08/2015 - 10/2019
Research Associate
Electrochemical Energy Lab, Mie University, Japan | 04/2011 - 03/2014

Education

Computer Science and Machine Learning
Community College and Coursera | 11/2022 - present
Data Science and Visualization Boot Camp
University of California San Diego | 05/2022 - 11/2022
M.Ed. in Curriculum and Teaching Methodology (Chemistry)
Nanjing Normal University (China) | 09/2007 - 07/2010
B.S. in Chemistry
Anhui Normal University | 09/2003 - 07/2007

Certificates

😄Thanks for reading!😄