NodeSphere
  • Communities
  • Create Post
  • Create Community
  • heart
    Support Lemmy
  • search
    Search
  • Login
  • Sign Up
eifachposte@lemmy.durstig.onlineMB to AI (Reddit RSS)@lemmy.durstig.onlineEnglish · 17 days ago

Better benchmarks make models better I'm excited

deepswe.datacurve.ai

external-link
message-square
0
link
fedilink
  • cross-posted to:
  • singularity@lemmit.online
  • hackernews@lemmy.bestiver.se
1
external-link

Better benchmarks make models better I'm excited

deepswe.datacurve.ai

eifachposte@lemmy.durstig.onlineMB to AI (Reddit RSS)@lemmy.durstig.onlineEnglish · 17 days ago
message-square
0
link
fedilink
  • cross-posted to:
  • singularity@lemmit.online
  • hackernews@lemmy.bestiver.se
DeepSWE
deepswe.datacurve.ai
external-link
DeepSWE measures frontier coding agents on original, long-horizon software engineering tasks.

Original Reddit post

Originally posted by u/NoFaithlessness951 on r/ClaudeCode

alert-triangle
You must log in or # to comment.

AI (Reddit RSS)@lemmy.durstig.online

ai_reddit@lemmy.durstig.online

Subscribe from Remote Instance

Create a post
You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: !ai_reddit@lemmy.durstig.online

AI (Reddit RSS Feed)

Visibility: Public
globe

This community can be federated to other instances and be posted/commented in by their users.

  • 39 users / day
  • 127 users / week
  • 410 users / month
  • 618 users / 6 months
  • 1 local subscriber
  • 38 subscribers
  • 8.61K Posts
  • 96 Comments
  • Modlog
  • mods:
  • eifachposte@lemmy.durstig.online
  • UI: 0.19.12
  • BE: 0.19.15
  • Modlog
  • Instances
  • Docs
  • Code
  • join-lemmy.org