ML Engineering Articles: Search, RecSys, LLM Agents, MLOps

ML Engineering Articles: Search, RecSys, LLM Agents, MLOpsProduction ML engineering articles on Search, RecSys, LLM agents, and MLOps: architecture decisions, release patterns, evaluation methods, and reliability practices.https://igor-ya.com/en-USEvals for LLM Agents: The Minimal Production Sethttps://igor-ya.com/posts/llm-agent-evals-production-framework/https://igor-ya.com/posts/llm-agent-evals-production-framework/The minimal production eval set for LLM agents: outcome and trajectory evaluation, code and model graders, pass^k reliability, judge calibration, release gates.Tue, 09 Jun 2026 00:00:00 GMTLLMAgentsEvalsLLM-as-judgeAgentOpsObservabilityCI/CDRelease GatesSearch and Recommendation Logs as Data for LLM Post-Traininghttps://igor-ya.com/posts/product-logs-post-training-llm-search-recsys/https://igor-ya.com/posts/product-logs-post-training-llm-search-recsys/How to turn search and recommendation logs into datasets for LLM post-training: SFT, DPO, GRPO, hard negatives, LLM-as-judge, and release gates.Thu, 28 May 2026 00:00:00 GMTLLMPost-trainingSFTDPOGRPOLLM-as-judgeSearchRecSysMLOpsData InfrastructureMultimodal Retrieval for LLMshttps://igor-ya.com/posts/multimodal-retrieval-llm-context-selection/https://igor-ya.com/posts/multimodal-retrieval-llm-context-selection/How multimodal retrieval is used around LLMs: hybrid search, visual document retrieval, reranking, context packing, citations, long context, agentic search, and eval.Sat, 25 Apr 2026 00:00:00 GMTMultimodal RetrievalLLMSearchRAGVisual Document RetrievalRerankingContext EngineeringGPTClaudeGeminiAdding Tool Calling to Search Systems Without Breaking Retrieval, Reranking, or Controlhttps://igor-ya.com/posts/agentic-search-production-tool-calling-retrieval-reranking-control/https://igor-ya.com/posts/agentic-search-production-tool-calling-retrieval-reranking-control/A production guide to placing tool calls before retrieval, after reranking, or after answer selection without losing relevance, latency, safety, or rollback control.Sat, 21 Mar 2026 00:00:00 GMTSearchRetrievalRerankingTool CallingAgentsMCPObservabilityAI SecurityAssistants API to Responses API Migration: Production Playbook Before August 26, 2026https://igor-ya.com/posts/assistants-api-to-responses-api-migration-playbook-2026/https://igor-ya.com/posts/assistants-api-to-responses-api-migration-playbook-2026/A production migration guide from Assistants API to Responses and Conversations covering timeline, entity mapping, breaking changes, rollout strategy, and parity testing.Tue, 03 Mar 2026 00:00:00 GMTAssistants APIResponses APIConversations APIOpenAIMigrationAgentOpsMLOpsLLMOpsAI EngineeringOffline-Online Gap in RecSys: 11 Release Gates and Incident Playbookhttps://igor-ya.com/posts/deep-learning-recsys-offline-online-gap-production/https://igor-ya.com/posts/deep-learning-recsys-offline-online-gap-production/A production guide to RecSys offline-online failures: feedback loops, delayed labels, train-serve skew, OPE limits, release gates, and incident response playbooks.Thu, 26 Feb 2026 00:00:00 GMTDeep LearningRecSysMLOpsFeedback LoopsDelayed FeedbackFeature SkewCounterfactual EvaluationObservabilityAgent or Workflow: How to Choose Architecture Without Hypehttps://igor-ya.com/posts/agent-vs-workflow-architecture-framework/https://igor-ya.com/posts/agent-vs-workflow-architecture-framework/A practical framework for deciding between workflow automation and agent architecture, including safety boundaries, eval design, cost trade-offs, and rollout guidance.Wed, 18 Feb 2026 00:00:00 GMTLLMAgentsWorkflowSystem DesignAgentOpsEvalsAI SecurityFinOpsMLOps for a Support RAG Agent in 2026: Releases, Security, and Costhttps://igor-ya.com/posts/mlops-rag-agent-support-release-gates-security-cost-2026/https://igor-ya.com/posts/mlops-rag-agent-support-release-gates-security-cost-2026/A production guide to shipping a support RAG agent with release gates, policy boundaries, tracing, evaluation loops, and cost control.Tue, 10 Feb 2026 00:00:00 GMTMLOpsRAGAgentOpsLLMOpsAI SecurityObservabilityFinOpsMLOps for Production ML: 7 Release Gates for Controlled Rolloutshttps://igor-ya.com/posts/mlops-release-gates-production-ml/https://igor-ya.com/posts/mlops-release-gates-production-ml/A production MLOps guide to the seven release gates that keep model rollouts inside quality, latency, reliability, and cost limits.Fri, 26 Dec 2025 00:00:00 GMTMLOpsModel RegistryCI/CDObservabilityDrift DetectionFinOpsSREAI SecurityigorOS: A Browser-Based Agent Interface You Can Actually Usehttps://igor-ya.com/posts/igoros-alternative-site/https://igor-ya.com/posts/igoros-alternative-site/A browser-based agent interface demo that shows how tool calling, app state, and visible execution loops work inside a desktop-style environment.Mon, 15 Dec 2025 00:00:00 GMTWeb OSTool CallingAgent UXReactTypeScriptUITraining a Hybrid LLM and Recommender System with Semantic IDshttps://igor-ya.com/posts/semantic-ids-llm-recsys/https://igor-ya.com/posts/semantic-ids-llm-recsys/How to train a hybrid LLM and recommender system with semantic IDs, retrieval-aware objectives, and controllable recommendation outputs.Mon, 20 Jan 2025 00:00:00 GMTLLMRecommendationsSemantic IDsRQ-VAEQwen3RetrievalRankingSASRec