Thursday, November 30, 2023
Show HN: Collider – the platform for local LLM debug and inference at warp speed https://ift.tt/6O3wnfA
Show HN: Collider – the platform for local LLM debug and inference at warp speed ChatGPT turns one today :) What a day to launch the project I'm tinkering with for more than half a year. Welcome new LLM platform suited both for individual research and scaling AI services in production. GitHub: https://ift.tt/lyRU3xO Some superpowers: - Built with performance and scaling in mind thanks Golang and C++ - No more problems with Python dependencies and broken compatibility - Most of modern CPUs are supported: any Intel/AMD x64 platofrms, server and Mac ARM64 - GPUs supported as well: Nvidia CUDA, Apple Metal, OpenCL cards - Split really big models between a number of GPU (warp LLaMA 70B with 2x RTX 3090) - Not bad performance on shy CPU machines, fast as hell inference on monsters with beefy GPUs - Both regular FP16/FP32 models and their quantised versions are supported - 4-bit really rocks! - Popular LLM architectures already there: LLaMA, Starcoder, Baichuan, Mistral, etc... - Special bonus: proprietary Janus Sampling for code generation and non English languages https://ift.tt/lyRU3xO December 1, 2023 at 02:02AM
Subscribe to:
Post Comments (Atom)
Show HN: Rewriting Scratch 3.0 from scratch in Lua (browser-free native runtime) https://ift.tt/5Af6h7W
Show HN: Rewriting Scratch 3.0 from scratch in Lua (browser-free native runtime) Built a native Scratch 3.0 runtime in Lua that runs .sb3 pr...
-
Show HN: An AI logo generator that can also generate SVG logos Hey everyone, I've spent the past 2 weeks building an AI logo generator, ...
-
Show HN: I Made an AI Social Media Manager to Automate Content Creation Hey HN, I am a Solopreneur, and I love building apps to automate bor...
-
RoboPianist, a piano playing robot simulation in the browser https://ift.tt/zywcBo6 March 30, 2023 at 10:52PM
No comments:
Post a Comment