Alex Kargin

Data Engineer

About Me

I'm Alex Kargin, a data and ML/AI engineer based in Boca Raton, Florida. I build production data pipelines and applied AI systems — the kind that actually get used, not the kind that win a hackathon and die in a prototype folder.

I founded kargin-utkin.com, a data engineering publication with hands-on writeups on PostgreSQL, ClickHouse, dbt, Airflow, PySpark and Kafka — plus a 14-course Learn platform covering SQL fundamentals through distributed systems internals. I also founded InnovaTek Solutions, a consultancy specializing in private AI installations and legacy-system modernization for regulated industries.

Currently focused on Snowflake data warehousing, Dagster orchestration, building end-to-end ELT pipelines, and self-hosted LLM deployments for air-gapped environments.

Skills

  • Data Warehousing: Snowflake, dbt, Star Schema, SCD, RBAC
  • Python: FastAPI, Pandas, scikit-learn, spaCy NLP
  • Databases: PostgreSQL, ClickHouse, MySQL, Redis
  • Orchestration: Dagster, Airflow, DAGs, Scheduling
  • Cloud & DevOps: Docker, Azure Data Factory, CI/CD
  • AI/ML: Ollama/LLM, Groq, Embeddings, NER
  • Architecture: Microservices, REST APIs, ETL/ELT
  • Frontend: Next.js, React, TypeScript

Portfolio

Production systems handling real-time data, ML pipelines, and cloud infrastructure.

LegacyToCloud

LegacyToCloud.com

Database migration platform that analyzes legacy MySQL/PostgreSQL/MSSQL schemas and generates cloud-ready DDL for PostgreSQL and Snowflake. Includes a real-time financial data pipeline (Alpha Vantage → PostgreSQL → ClickHouse) with interactive dashboards.

CelebrityMention

CelebrityMention.com

Real-time news monitoring system that ingests 100+ articles daily from 30+ RSS feeds, applies spaCy NER to identify mentions across a 132K celebrity database, and scores content on a 1–10 severity scale with fuzzy name matching.

AllMyQuotes

AllMyQuotes.com

Quote discovery platform with 3M+ quotes and 100K+ authors. Features ML classification pipelines, local LLM inference (Ollama), sentiment analysis, and multi-server architecture with dedicated GPU processing for ETL workloads.

Laurela

Laurela.com

Prediction market analytics with dual-server architecture. A local GPU analytics engine orchestrated by Dagster pushes processed data to production via SSH tunneling for real-time market insights with LLM-powered analysis.

Boss.cc

Boss.cc

Crypto trading desk simulator with real-time market data ingestion from CoinGecko API, automated AI-driven daily market analysis, decision tracking, and scheduled data pulls with historical analytics.

Kargin-Utkin

Kargin-Utkin.com

Data & ML engineering hub with technical guides on ETL pipelines, vector databases, GPU analytics, and lakehouse architecture. Features curated AI/data industry news, tool comparisons, and an ecosystem directory of 23+ categories across the modern data stack.

FreeDiva

FreeDiva.com

AI-powered personal styling platform that analyzes body type, skin tone, and proportions from uploaded photos to generate tailored outfit recommendations. Features a chat stylist interface, body shape classification, and shoppable outfit suggestions.

Ask about Alex
Hi! Ask me about Alex's skills, projects, or experience.