sam@latino:~$

sam@latino:~$ whoami

AI engineer. Agent systems, evals, and self-hosted LLM infrastructure — Rust + Python.

6 open-source projects  ·  524 tests  ·  Rust · Python · TypeScript

~/projects

6 dirs

~/writing

rss

· retrieval · bm25 · benchmarks

BM25 beat my vector database (sometimes)

A crossover framework for lexical versus vector retrieval on code — and the adversarial bench harness I built so my own argument can lose.

· tool calling · evals · vllm · gateways

Every model fails tool calling differently

Tool calling is the load-bearing primitive of every agent stack, and open models break it in at least eleven distinguishable ways. Naming the failure modes changes how you build the layer above.

· agent security · owasp · prompt injection · evals

Red-teaming my own agents with the OWASP Agentic Top 10

Turning "resists prompt injection" into a regression number: a deterministic harness, 146 probes across five OWASP agentic categories, and a hardening sweep that went 73% → 3% → 0%.

~/contact

email
latinosammy2@gmail.com
github
github.com/slatino-dev
linkedin
linkedin.com/in/samlatino
hf
huggingface.co/SamLatino