Agent Reinforcement Trainer: train multi-step agents for real-world tasks using GRPO. Give your agents on-the-job training. Reinforcement learning for Qwen2.5, Qwen3, Llama, and more!
仓库
rlancemartin 的仓库
An experimental open-source attempt to make GPT-4 fully autonomous.
A python program that turns an LLM, running on Ollama, into an automated researcher, which will with a single query determine focus areas to investigate, do websearches and scrape content from various relevant websites and do research for you all on its own! And more, not limited to but including saving the findings for you!
Biomni: a general-purpose biomedical AI agent
Doing simple retrieval from LLM models at various context lengths to measure accuracy
🐚 OpenDevin: Code Less, Make More
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
A collection of projects designed to help developers quickly get started with building deployable applications using the Anthropic API
Evaluation tool for LLM QA chains
An open source ChatGPT UI.
A simple memory system for claude code
Continual Learning Bench
DeepResearch Bench: A Comprehensive Benchmark for Deep Research Agents
Devika is an Agentic AI Software Engineer that can understand high-level human instructions, break them down into steps, research relevant information, and write code to achieve the given objective. Devika aims to be a competitive open-source alternative to Devin by Cognition AI.