Back to whynotsleep.cc

Large-model / multimodal algorithm engineer

Winston

Winston is a large-model and multimodal algorithm engineer focused on post-training for LLM/MM systems, agentic reinforcement learning, and generative search advertising and recommendation. This site is the public index for technical writing, games, project notes, design studies, manuscripts, and slower personal records.

Current focus

Large-model systems, multimodal post-training, and agentic optimization.

LLM/MM post-training

Preference optimization, instruction following, multimodal alignment, evaluation loops, and data recipes that make model behavior measurable.

Agentic RL

Training and evaluation patterns for agents that plan, use tools, recover from errors, and improve through interaction instead of static prompting alone.

Generative search ads and recommendations

Retrieval, ranking, generation, auction-aware objectives, user intent modeling, and feedback systems for search and recommendation surfaces.

Operating style

Artifacts, evidence, and stable routes.

  1. Treat claims as artifacts: show constraints, evidence, failure modes, and trade-offs.
  2. Keep public routes stable so projects, notes, and games can mature without link churn.
  3. Design interfaces with calm density: scannable, direct, and useful under repeated visits.

Contact

Open a new route.