Wednesday, July 31, 2024
Show HN: FP32 matmul of large matrices up to 24% faster than cuBLAS on a 4090 https://ift.tt/DzrnkyB
Show HN: FP32 matmul of large matrices up to 24% faster than cuBLAS on a 4090 I decided to share a CUDA kernel I wrote over 5 months ago. Nvidia's hardware and software may surprise you. https://ift.tt/VCNbqEz August 1, 2024 at 12:09AM
Subscribe to:
Post Comments (Atom)
Show HN: Haystack – Review pull requests like you wrote them yourself https://ift.tt/iyMvPkE
Show HN: Haystack – Review pull requests like you wrote them yourself Hi HN! We’re Akshay and Jake. We put together a tool called Haystack t...
-
Show HN: An AI logo generator that can also generate SVG logos Hey everyone, I've spent the past 2 weeks building an AI logo generator, ...
-
Show HN: I Made an AI Social Media Manager to Automate Content Creation Hey HN, I am a Solopreneur, and I love building apps to automate bor...
-
To become a CA foundation course grad, you must complete the CA foundation coaching classes. In addition to attending these classes, you w...
No comments:
Post a Comment