An engineering guide to turning product logs from search, recommendations, and multimodal retrieval into data for SFT, DPO, GRPO, reward models, and LLM-as-judge pipelines.
Search Engineering
Production search engineering guides: ranking and retrieval quality, latency budgets, relevance evaluation, and operational patterns for stable search systems.
An engineering article on the context selection layer around GPT, Claude, and Gemini: how to search, rank, and package PDFs, tables, screenshots, and visual evidence for grounded LLM answers.
A field guide for search teams adding tool calling as a bounded capability layer without losing relevance, latency discipline, safety, or rollback control.
How to teach a language model to understand a catalog through semantic IDs and produce controllable recommendations with explanations