Reviving Papers with Code

(paperswithcode.co)

96 points | by nielz_r 2 days ago

6 comments

  • nielz_r 2 days ago
    Hi,

    Niels here from the open-source team at Hugging Face. Like many others, I was a huge fan of paperswithcode, a website which allowed to easily find the state-of-the-art (SOTA) across any domain of AI, from computer vision to language models to time-series forecasting. Sadly, that website is no longer maintained after its acquisition by Meta.

    Hence, I've been working on reviving it. I obviously use AI agents to parse papers at scale and automatically generate leaderboards (for now I'm the one verifying results). So far, I've only parsed high-impact papers for which I know they're SOTA, like Qwen 3.5 and 3.6, RF-DETR for object detection, DINOv3, SOTA embedding models from the MTEB leaderboard, the Open ASR Leaderboard for automatic speech recognition models, etc.

    For now, it includes the following:

    > trending papers by default based on Github star velocity

    > categorization by domain, e.g., [OCR](https://paperswithcode.co/tasks/ocr)

    > methods, popular techniques used across AI papers, which PwC used to have as well, like [RLVR](https://paperswithcode.co/methods/rlvr) and

    > eval results for high-impact papers, see e.g., Qwen 3.5 at the bottom

    > leaderboards for each domain, e.g., MMTEB or COCO val 2017

    > conferences, like [CVPR 2026](https://paperswithcode.co/conferences/cvpr-2026)

    > support for citation counts (you can also see the most cited papers by domain!)

    > automated linked Github, project page URLs, and artifacts (+ multiple repos are supported on a paper page)

    > support for external papers beyond Arxiv, see e.g., [DeepSeek v4](https://paperswithcode.co/paper/82956)

    > Harness reports for coding agent benchmarks, e.g., Terminal Bench

    > "Sign in with HF" and Storage Buckets are used to store humbnails, paper PDFs, and overall data backups.

    I'm curious about your feedback + feature requests!

    Try it at https://paperswithcode.co

    • cyril_st_john 40 minutes ago
      Any interest in expanding it beyond just AI papers? "Papers with Code" sounds like it could be much more broad than it currently is. I was excited to browse the "All Domains" section until I realized only AI topics are covered - just because so many of the papers that are relevant to my work would not fall under any of these categories.
    • vjsrinivas 1 hour ago
      What will happen to Huggingface's Trending Paper page? Its been my alternative since PWC closed, but they seem to have a level of overlap.
    • somethingsome 1 hour ago
      Hi! Thanks for the effort!

      It would be lovely to parse which datasets/benchmarks were used in the comparisons and select papers by dataset!

      In many fields the datasets vary greatly depending on the subfield and its very difficult to find what other benchmarks could be used.

    • 2ap 2 hours ago
      This is great. To test it out I just submitted one of my papers on medRXiv and it was super straightforward to do.
    • Ajoha 3 hours ago
      Hi, this is really interesting and I’ll pin that URL. :) Is there something similar for papers regarding psychology, neuroscience and tech?
      • barrenko 1 hour ago
        Tho I am not sure, try checking out the Huggingface's dedicated science discord.
    • caldarons 3 hours ago
      This is great work, thank you!

      One feature I would love is to get notified via email when new papers are added (or periodically, once a week/daily).

    • adithyaharish 2 hours ago
      This is great work, keep it going
      • adithyaharish 2 hours ago
        I have pinned the url and added it to be tab group bookmark
    • wanderlust123 4 hours ago
      Thank you, I think this is a noble effort. Knowledge is being created at a rapid rate and being able to surface interesting stuff is valuable.
  • Sharlin 55 minutes ago
    Shame about the name, it feels better suited to a more general curated repo/hall-of-fame of papers in any field that come with easily rerunnable code to reproduce the paper’s results, or try out different datasets, or similar.
  • quibono 1 hour ago
    This is a bit off-topic (though tangentially related) - does anyone remember a similar blog where the author would do something like a "5 minute paper" review, i.e. they'd discuss findings and try to communicate the main point? It was usually a paper per week, mostly CompSci / maths papers IIRC
    • henrythewasp 1 hour ago
      Was it "The morning paper" by Adrian Colyer? - https://blog.acolyer.org/
      • ndr 1 hour ago
        Thank you for resurfacing this, it has been my daily commute read for years, it was great!
      • quibono 24 minutes ago
        Yes, exactly this! Thank you!
  • steinvakt2 2 hours ago
    Yes please! I have been frustrated with the state of object detection models especially. Everyone claims SOTA. So you end up having to test manually to find out which one actually is. And unlike LLM's, it should be pretty easily quantifiable.
  • jamoio 2 hours ago
    Is there an RSS feed?
  • kozzion 3 hours ago
    Bring it back! Sing it back!