Skip to content
View freddiev4's full-sized avatar
🚢
ship it
🚢
ship it

Organizations

@github-beta @jupyterlab @Cohere-Labs-Community @Hugging-Face-Supporter @Hugging-Face-Helping-Hand @quotient-ai

Block or report freddiev4

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
freddiev4/README.md

✨ Hello! ✨

I build systems and tools for Machine Learning -- previously I worked at GitHub on GitHub Copilot building evals & infra for tab completion and Chat, and did open research with Cohere Labs on Aya.

Follow me on X for updates.

Past Research

Venue Paper Contributions
Nature 2026 Humanity's Last Exam Contributed a difficult math & statistics question about theoretical max damage in Old School Runescape
ACL 2024 (Best Paper Award) Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model Data Engineering to create the Aya Human Annotated set of the Aya eval suite: dataset
ACL 2024 (1st Author) Aya Dataset: An Open-Access Collection for Multilingual Instruction Tuning Designed, built, and scaled out the backend for an open data annotation platform for over 3000 contributors and 300,000+ annotations for training multilingual instruction-tuned LLMs on mixed-resource languages: repo. Co-created the Aya Score to encourage participants to incorporate more edits during annotation, increasing average annotation length by >50%
CSCW 2017 Crowd Guilds: Worker-led Reputation and Feedback on Crowdsourcing Platforms Helped design a reputation system for workers and requesters to gain experience, receive higher wages and higher quality work

Current Projects

  • rune — a coding agent for data engineering, search, and analytics over my personal data
  • aqueduct — agent-owned DAG-based data pipelines to NAS & S3
  • yts3 — rust library for encoding arbitrary files into lossless video, using YouTube as S3 storage

Pinned Loading

  1. agents agents Public

    plugins, skills, scripts, etc for interacting with coding agents

    Shell 1

  2. pokeshadowbench pokeshadowbench Public

    "Who's that Pokemon?" evals for LLMs

    Python 2

  3. aqueduct aqueduct Public

    agent-owned DAG-based data pipelines to NAS & S3

    Python 3

  4. rune rune Public

    general purpose data agent for swe data tasks

    Python 5

  5. yts3 yts3 Public

    rust library for using YouTube as S3 storage

    Rust 2

  6. fvfs fvfs Public

    virtual filesystem with tiered storage across local disk, NAS, and S3

    Rust 1