|
i'm an AI/ML engineer based in the US. right now i'm building production AI systems at Reallytics.ai and Verticiti, mostly getting large language models to do useful things in the real world. not demos, actual systems with real users and real traffic. before this i was at Afiniti and Cloud Kinetics for a few years. fraud detection, voice analytics, enterprise search. the kind of stuff that pages you at 3am when something breaks. honestly what keeps me going is when an agent you built solves something you never explicitly told it to do. that feeling never gets old. what i'm working on right now:
|
|
|
Agentic AI Workflows |
RAG Enterprise Search |
|
Voice AI Platform |
LLM Fine-Tuning LoRA |
|
RLHF LLM Optimization |
Sentinel Fraud Detection |
not going to pretend i use everything equally. here's what i actually reach for:
the full picture (click to expand)
| daily drivers | Python, PyTorch, FastAPI, Docker, Git, VS Code |
| LLM and GenAI | LangChain, LlamaIndex, HuggingFace Transformers, vLLM, PEFT/LoRA/QLoRA |
| data and vector | FAISS, ChromaDB, Pinecone, PostgreSQL, MongoDB, Redis, Kafka, Elasticsearch |
| cloud and MLOps | AWS (SageMaker, Bedrock, Lambda, ECS), GCP Vertex AI, Azure OpenAI |
| ML frameworks | TensorFlow, scikit-learn, XGBoost, LightGBM, ONNX |
| infrastructure | Kubernetes, Terraform, GitHub Actions, MLflow, Weights & Biases |
i write about what i'm building and learning. nothing polished, more like notes to my future self that happen to be public.
|
|
Real World Applications Of Reinforcement Learning
|
Real Time Data Streams For Ml Model Training
|
💬 Commented on [Bug]: MiniMax-M3-MXFP8 hangs on FlashInfer MNNVL all-reduce in vllm-project/vllm (2026-06-16)
💬 Commented on ci: streamline GitHub Actions test coverage and runtime in feast-dev/feast (2026-06-16)
💬 Commented on [Bug]: Artifact integrity report follows security policy in wandb/wandb (2026-06-16)
💬 Commented on bug: async delete_all race condition corrupts entity store l in mem0ai/mem0 (2026-06-16)
💬 Commented on Community Case Study: Structured constraints improve code ge in zai-org/ChatGLM-6B (2026-06-16)
💬 Commented on [provider-mapping-sweep] cohere: prompt-cache hit count (usa in pydantic/pydantic-ai (2026-06-16)
💬 Commented on API-key (Bearer) auth still depends on Keycloak offline sess in OpenHands/OpenHands (2026-06-16)
💬 Commented on First-party Agent Skills support in langchain (without depen in langchain-ai/langchain (2026-06-16)
stuff i've been digging into recently. mostly papers, blog posts, and rabbit holes that kept me up too late.
🔬 AI Model Monitoring, Evaluation, and Automated Feedback Loops
🔬 Retrieval-Augmented Generation (RAG) in Production Search and Chat Applications
🔬 Large Language Model (LLM) Fine-Tuning with Parameter-Efficient Techniques
🔬 Causal Inference and Discovery in Observational Data
🔬 Real-World Applications of Reinforcement Learning from Human Feedback
🔬 Real-Time Data Streams for ML Model Training
📌 Streaming JSON Parser for Large Language Models — Production Pattern (Python) (2026-06-16)
📌 Streaming JSON Parser for Large Language Models — Production Pattern (Python) (2026-06-16)
📌 Multi-Provider LLM Router with Fallback — Production Pattern (Python) (2026-06-15)
🤖 Profile auto-updated on 2026-06-16 17:15 UTC


