Skip to content
View gguillau's full-sized avatar
🏠
Working from home
🏠
Working from home

Block or report gguillau

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
gguillau/README.md

👋🏾 Welcome, I'm Giovanni

Data enthusiast from Brooklyn, New York, with a background in Psychology and Research. Proficient in Python, Excel, Tableau, and SQL, highly motivated and analytical with a strong foundation in data analysis/visualizations, machine learning, and statistical modeling.

Gmail Badge Portfolio Badge

  • 🔭 Currently working on adding projects to my repo and searching for a position in Data.
  • 📈 Committed to continuously expanding my knowledge in the evolving field of data science.
  • 👨🏿‍💻 My projects are available at my Data Science & Analytics Portfolio
  • 💬 Ask me about Data Science, Music, Film & TV/Anime, and/or Soccer/Basketball!

Technical Projects

Here are some projects that I'm particularly proud of (WIP = Work in Progress):

The 2024-2025 NBA season is just starting and you have just landed a job as a data scientist for your favorite NBA team. With various changes in play style, officiating, and general strategy things look very different for professional basketball than when the NBA first started. In this project, we hope to determine different player “archetypes” which you can think of as types of roles that are not concretely defined.


The task is to train a machine learning model that can automatically generate answers to written questions a user inputs. For this purpose, a model will be trained with questions and answers using the Python Questions from Stack Overflow dataset.


Contributed research to the company's infrastructure, with the goal of training a deep learning model using BERT to predict user geolocation from individual tweets. Yachay is an open-source Machine Learning community that has collected decades worth of useful natural language data from various sources.


Tasked with developing a Python-based regression model to predict the valence of pop songs for playlist curation and other applications. Valence describes the musical positiveness of a track, ranging from sad/depressed to happy/cheerful. An automatic method of classifying the valence of pop songs is useful for playlist curation and other applications.

Programming Languages and Tools

python postgresql mysql mssql pandas html5 scikit_learn tensorflow pytorch seaborn linux linux linux

   

gguillau

Pinned Loading

  1. Data-Science-Portfolio Data-Science-Portfolio Public

    Repository containing portfolio of data analysis and machine learning projects

    Jupyter Notebook 1

  2. Yachay.ai-Tweet-Geolocation-Prediction Yachay.ai-Tweet-Geolocation-Prediction Public

    This project takes on the goal to improve upon Yachay.ai's infrastructure to train a deep learning model to predict coordinates of individual texts.

    Jupyter Notebook 5

  3. Cuetessa-Song-Valence-Prediction Cuetessa-Song-Valence-Prediction Public

    Develop a Python-based module for a startup to predict the valence of newly released pop songs

    Jupyter Notebook 2