Projects

Here are my production ML projects: from generative AI and recommendations to MLOps infrastructures and observability. In each case - problem, architecture, and real result.

Voice AI Operator for Call Center

On-prem voice AI operator handles 72% of calls without human in 0.96s with 58% cost reduction.

Client: NDA Domain: FinTech

Problem: 600 seats in contact center, 9 min wait, SLA penalties and new AI Act requirements, regulations outdated faster than operators can learn.

Solution: On-prem stack with streaming, model cascade, orchestration, and knowledge base. Safety rules and manual escalation.

ML Inference Latency and Cost Evaluation Platform

Internal tool for profiling latency, throughput, and $/req of models in production

Client: NDA Domain: E-commerce

Problem: No unified monitoring standard: teams deployed models randomly, GPUs idle, latency fluctuated, costs not tracked.

Solution: Built platform with Prometheus, Kubecost, and Torch/ONNX profiling - now visible latency, throughput, load, and $/req at model level.

Telegram Antifraud Analytics for Media Plans

Fraud detection system reduces inefficient spending by 24% and automates verification of 100 channels in 12 minutes

Client: NDA Domain: AdTech

Problem: 30% of ad budget lost on channels with fraud, manual verification of 100 channels takes 25 hours

Solution: Hybrid detection system (rule-based + anomaly) with batch processing and adaptive thresholds by topics