About me:

Hello, welcome to my portfilo.

I'm Fiona, a junior data scientist who is a recent graduate in the field of Data Science. I have a bachelor's degree in Information Technology, which is what started me on the road to the data science field. I found my passion in data science soon after I finished my degree and thus decided to take the next step and enroll in a few courses. My most recent course was at LearningFuze, where they specialize in teaching data science and web development. There, I was able to gain the knowledge and hands on skills that I need to further my career in the data science field. At the moment I am working on completing the data science certificate with DataCamp.

A few of my skills:

  • Python, SQL, Spark
  • Machine Learning: Supervised and unsupervised ML models
  • Neural networking, Regression and Classification
  • Data Preparation, Cleaning, Visualization
  • Modeling, Debugging, Problem Solving
  • Kaggle competitions

Projects:

Here are a few of my most recent projects. I upload both completed projects and projects that I will be completing soon, so make sure to come back to see what I have completed. If you have any questions on any of my projects listed here then don't be afraid to contact me and ask.

House Prices: Advanced Regression Techniques (Completed)

This was a competition project from Kaggle. The goal of this project was to be able to predict the sale price of the houses by using feature engineering and working with advanced regression techniques. At the time of submission I was in the top 12%.

WiDS Image Recognition

In progress...

This is an image recognition project where I am working with neural networks. This project is about creating a model that will be able to predict oil palm plantations from satellite images.

Home Credit Default Risk (Completed)

A kaggle competition where you predict how capable each applicant is for repaying a loan. An import part of this project was how I managed the data imbalance, I handeled it by using a Pipeline and SMOTE. I also used LGBM Classifier for the model.

Store Sales - Time Series Forecasting

In progress...

This time series project is to be able to predict the unit sales for thousands of items sold at different stores.

Spaceship Titanic (Completed)

Instead of the regular Titanic dataset that everyone knows, this dataset is about the future titanic set in Space where instead of the passengers surviving or not they are “Transported “ into another dimension. At the time of submission I was in the top 24%

JPX Tokyo Stock Exchange Prediction

In progress...

This stock exchange competition is what I am currently working on. The goal of the project will compare my model against real future returns after the training phase is complete.

My CV: