Peace Be Upon You Questers 😇
This is Yasir Hamid Ansari, today sharing something very new,
Meet Devin, the world’s first fully autonomous AI software engineer.
Devin is a tireless, skilled teammate, equally ready to build alongside you or independently complete tasks for you to review.
With Devin, engineers can focus on more interesting problems and engineering teams can strive for more ambitious goals.
With advances in long-term reasoning and planning, Devin can plan and execute complex engineering tasks requiring thousands of decisions. Devin can recall relevant context at every step, learn over time, and fix mistakes.
Devin is also equipped with common developer tools including the shell, code editor, and browser within a sandboxed compute environment—everything a human would need to do their work.
Finally, Devin is having the ability to actively collaborate with the user. Devin reports on its progress in real time, accepts feedback, and works together with you through design choices as needed.
Devin can learn how to use unfamiliar technologies.
After reading a blog post, Devin runs ControlNet on Modal to produce images with concealed messages.
Devin can build and deploy apps end to end.
Devin makes an interactive website which simulates the Game of Life! It incrementally adds features requested by the user and then deploys the app to Netlify.
Devin can autonomously find and fix bugs in codebases.
Devin helps us maintain and debug his open source competitive programming book.
Devin can train and fine tune its own AI models.
Devin sets up fine-tuning for a large language model given only a link to a research repository on GitHub.
Devin can address bugs and feature requests in open source repositories.
Given just a link to a GitHub issue, Devin does all the setup and context gathering that is needed.
Devin can contribute to mature production repositories.
This example is part of the SWE-bench benchmark. Devin solves a bug with logarithm calculations in the sympy Python algebra system. Devin sets up the code environment, reproduces the bug, and codes and tests the fix on its own.
Devin is even tried for real jobs on Upwork, and it could do those too!
Here, Devin writes and debugs code to run a computer vision model. Devin samples the resulting data and compiles a report at the end.
Devs evaluated Devin on SWE-bench, a challenging benchmark that asks agents to resolve real-world GitHub issues found in open source projects like Django and scikit-learn.
Devin correctly resolves 13.86%* of the issues end-to-end, far exceeding the previous state-of-the-art of 1.96%. Even when given the exact files to edit, the best previous models can only resolve 4.80% of issues.
All the above Content is taking from here .
Hope You Like It!
Follow @YasirTheJOD for more such Qool Updates in the iQOO Community
See You Again!
Happy Questing!
Please sign in
Login and share