Wednesday, July 23, 2025

Show HN: Kafka, the first AI employee (NEW SOTA ON GAIA BY 20%) https://ift.tt/k2vgUEl

Show HN: Kafka, the first AI employee (NEW SOTA ON GAIA BY 20%) Hi HN, I'm Gokhan, the founder of Brainbase Labs. Today we're releasing an early preview of our first generalist agent, Kafka. Kafka is the first AI employee, he comes with his own computer as well as his own email, phone and Slack so you can work with him just like you would with a regular employee. You can forward him emails, give him a call, tag him on Slack. We built Kafka as the basis for our other AI employees we will be releasing over the coming months. Kafka currently achieves 77.2% on the GAIA Level 3 benchmark, getting us closer to human performance at 87%. We've achieved this by creating a new type of planning algorithm called "structured planning" which allows Kafka to run very long term plans without getting sidetracked or hallucinating. Kafka can do some cool things, he can push code to AWS, direct its own commercial using Veo3 and do actual production tasks on Upwork/Fiverr. We're very keen to hear what HN thinks about Kafka, and how we can improve. Appreciate any feedback! https://ift.tt/7oEhJ4w July 23, 2025 at 10:21PM

No comments:

Post a Comment

Show HN: TheProtector – Linux Bash script for the paranoid admin on a budget https://ift.tt/Ij9Uo26

Show HN: TheProtector – Linux Bash script for the paranoid admin on a budget Hi HN, I spent the past year building this in my spare time bec...