×
Ronnagon Phukahuta

Ronnagon Phukahuta

Data Engineer | Data Platform & Big Data Architecture

Chiang Mai, Thailand, TH
+66 95-414 5964

Background


About

About

Data Engineer with 2+ years of experience building production-grade data platforms from the ground up. Proven track record in large-scale refactoring, CI/CD pipeline design, and Hadoop/Spark data lake architecture in a real organizational environment. Sole engineer across full-stack data systems — from raw ingestion to NLP-powered analytics. Actively expanding toward cloud-native architecture and multi-cloud deployment.

Work Experience

Work Experience

  • Data Engineer, Information Technology Service Center (ITSC), Chiang Mai University

    Jan, 2024 - Present

    • Spearheaded a major architectural refactor across 96 files to migrate legacy script-based systems to standardized Python packaging (PEP 517/621).

    • Eliminated 30+ 'sys.path' technical debt locations, achieving 100% environment portability and reducing system complexity.

    • Designed and optimized Hadoop/Spark pipelines and Airflow DAGs for high-reliability production environments.

    • Implemented a robust CI/CD suite with 315+ automated tests using GitHub Actions, ensuring zero-downtime deployments.

Projects Experience

Projects Experience

  • Finance Data Lake Platform (CMU ITSC)

    - Present

    Designed and built a production data lake platform from scratch for Chiang Mai University's financial data, serving as the sole engineer across the full stack — ingestion, transformation, schema management, monitoring, and NLP-powered analytics dashboard.

    • Architected end-to-end ETL pipelines using PySpark, Hive, and HDFS handling real organizational financial data.

    • Built a schema evolution and version management system with automated approval workflows and audit logging.

    • Developed an NLP-powered analytics dashboard (Streamlit + OpenAI) enabling natural language queries over financial datasets.

    • Implemented automated data quality validation rules and statistical trend monitoring across all datasets.

    • Established full CI/CD pipeline (GitHub Actions, 7 jobs) with 315+ unit and integration tests — zero-downtime deployment standard.

  • Real-time Stock Tracker (Solo Portfolio Project)

    - Present

    End-to-end financial data platform built solo, demonstrating full ownership from data ingestion to interactive dashboard — a rare combination at this experience level.

    • Built a complete data pipeline using yfinance, DuckDB, and Polars for high-performance financial data processing.

    • Developed a multi-page Streamlit dashboard with Daily Briefing, Unified Insight Hub, and Data Freshness monitoring.

    • Implemented performance instrumentation, silent failure detection across 62 code paths, and structured code quality standards.

    • Currently extending toward cloud-native deployment across multiple providers (Databricks, AWS, GCP, Azure) as a hands-on cloud learning vehicle.

  • Namjai Website (Full-stack Development)

    - Present

    Developed a comprehensive web application using Java Spring Boot, focusing on RESTful API design and database optimization.

    • Designed and implemented secure backend services and RESTful APIs.

    • Optimized SQL schemas to handle high-concurrency user traffic.

Skills

Skills

  • Data Infrastructure

    Hadoop

    Apache Spark

    PySpark

    HDFS

    Hive

    Apache Airflow

    DuckDB

    Polars

  • Engineering & DevOps

    Refactoring

    CI/CD

    GitHub Actions

    Docker

    Pytest

    Ruff

    Python Packaging (PEP 517/621)

  • Data & Analytics

    Streamlit

    SQL

    NLP

    OpenAI API

    Data Quality

    ETL Pipeline Design

  • Software Development

    Python (Expert)

    SQL

    C#/.NET Core

    Java Spring Boot

    React Native

Education

Education

  • Computer Engineering, Bachelor of Engineering, Chiang Mai University

    Jan, 2019 - Jan, 2023