Projects

Here are my production ML projects: from generative AI and recommendations to MLOps infrastructures and observability. In each case - problem, architecture, and real result.

Voice AI Operator for Call Center

Voice AI Operator for Call Center

On-prem voice AI operator handles 72% of calls without human in 0.96s with 58% cost reduction.

Client: NDA Domain: FinTech

Problem: 600 seats in contact center, 9 min wait, SLA penalties and new AI Act requirements, regulations outdated faster than operators can learn.

Solution: On-prem stack with streaming, model cascade, orchestration, and knowledge base. Safety rules and manual escalation.

ML Inference Latency and Cost Evaluation Platform

ML Inference Latency and Cost Evaluation Platform

Internal tool for profiling latency, throughput, and $/req of models in production

Client: NDA Domain: E-commerce

Problem: No unified monitoring standard: teams deployed models randomly, GPUs idle, latency fluctuated, costs not tracked.

Solution: Built platform with Prometheus, Kubecost, and Torch/ONNX profiling - now visible latency, throughput, load, and $/req at model level.

Telegram Antifraud Analytics for Media Plans

Telegram Antifraud Analytics for Media Plans

Fraud detection system reduces inefficient spending by 24% and automates verification of 100 channels in 12 minutes

Client: NDA Domain: AdTech

Problem: 30% of ad budget lost on channels with fraud, manual verification of 100 channels takes 25 hours

Solution: Hybrid detection system (rule-based + anomaly) with batch processing and adaptive thresholds by topics