Skip to content
View lucas-nelson-uiuc's full-sized avatar
🎧
jamming
🎧
jamming

Block or report lucas-nelson-uiuc

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
lucas-nelson-uiuc/README.md

Hi, I'm Lucas Nelson

About Me

I'm a data analyst/engineer with a strong foundation in statistics, currently transitioning into analytics engineering.

I enjoy designing tools that make working with data more intuitive, efficient, and reproducible. Whether it's optimizing pipelines, automating tedious processes, or building scalable validation frameworks, I aim to bridge the gap between data engineering and analytics.

Technologies & Tools

  • Big Data: PySpark, Polars, DuckDB
  • Python: Pydantic, Dagster, DLT, Narwhals
  • Automation & DevOps: Pre-commit, Ruff, MkDocs-Material
  • Data Validation: Custom-built PySpark validation tools inspired by tidylog & Pydantic
  • Web Apps: Shiny for interactive analytics dashboards

Featured Projects

A declarative schema validation and transformation framework for PySpark, Polars, and DuckDB, designed with Narwhals to bring structure and clarity to data workflows.

An on-going effort to track and analyze my bowling scores.

A deep dive into Spotify’s API to analyze music trends, listening habits, and playlist dynamics.

Teaching & Knowledge Sharing

I have a strong background in statistics and have taught courses at UIUC, covering topics in Python, data science, and big data. Some highlights:

  • CS 105 (Intro to Python) – Co-led weekly lectures on Python applications in statistics and CS.
  • STAT 430 (Data Science in Python) – Built an automated grading bot for GitHub submissions.
  • STAT 480 (Big Data Fundamentals) – Designed a testing suite for lab assignments.
  • I also host weekly training sessions at work, covering Python fundamentals and best practices in analytics engineering.

Get in Touch

Always open to discussing data engineering, analytics, and workflow optimization!

Pinned Loading

  1. podium podium Public

    DataFrame expressions constructors using Narwhals.

    Python

  2. narwhals-dev/narwhals narwhals-dev/narwhals Public

    Lightweight and extensible compatibility layer between dataframe libraries!

    Python 855 131

  3. pinsdb pinsdb Public

    WIP database for tracking and storing bowling games.

    Jupyter Notebook

  4. python-training python-training Public

    Articles and exercises to further a programmer's Python skillset.

    Python