Sergio Márquez

Python AI Developer - FastAPI, LLMs, RAG

I build LLM-powered backends in Python: FastAPI APIs, RAG pipelines, and pragmatic automations with n8n. Focused on shipping reliable, production-ready services.

Experience

AI/ML Developer

VITALY

May 2025 - Present

Backend engineer applying AI to real products and internal tooling.

  • Automated RAG ingestion via n8n + Python (Sheets -> MongoDB + Pinecone) with hash dedupe; removed manual uploads; initial loads of 500-600 docs; saved ~8-10 hours per setup.
  • Improved document validation pass rate from 70% to 90% using a GPT+Claude+Gemini ensemble, prompt/parse improvements, and OCR tuning (Tesseract, OpenCV).
  • Migrated chat to an agentic architecture (Google ADK) with LiteLLM; retrieval on MongoDB with custom chunking and reranking; ~15% lower median latency and ~35% lower infra cost.

Full-Stack Developer

VITALY

Apr 2021 - May 2025

Owned the Java/Spring backend and supported cloud/data integrations.

  • Middleware APIs with Spring connecting Angular frontends and third-party services.
  • Workloads on Kubernetes with automated GitLab CI/CD pipelines.
  • Post-merger data integration (Preving/Cualtis): analysis, mapping, and refactors.

Projects

One dAIly Blog

Live

Daily auto-generated blog powered by Gemini.

  • Angular
  • Node.js
  • Gemini
  • PostgreSQL
  • Tailwind

Certifications