Wednesday, July 31, 2024
Show HN: FP32 matmul of large matrices up to 24% faster than cuBLAS on a 4090 https://ift.tt/DzrnkyB
Show HN: FP32 matmul of large matrices up to 24% faster than cuBLAS on a 4090 I decided to share a CUDA kernel I wrote over 5 months ago. Nvidia's hardware and software may surprise you. https://ift.tt/VCNbqEz August 1, 2024 at 12:09AM
Subscribe to:
Post Comments (Atom)
Show HN: Lazycontainer: A Terminal UI for Apple Containers https://ift.tt/vEbFfZJ
Show HN: Lazycontainer: A Terminal UI for Apple Containers Apple finally released native support for Containers, but it's missing a term...
-
Show HN: High school robotics code/CAD/design binder release Hello HN! My name is Patrick, and I am a junior at my High School’s FRC robotic...
-
Show HN: D&D meets Siri – Interactive voice adventure Hey HN! I've been building tooling for voice-driven apps over the past few mon...
-
Show HN: I Made an AI Social Media Manager to Automate Content Creation Hey HN, I am a Solopreneur, and I love building apps to automate bor...
No comments:
Post a Comment