| | Inference at the edge: Efficient transformer model inference on-device (github.com/ggerganov) |
| 3 points by lioeters on July 2, 2023 | past |
|
| | WIP Llama.cpp Vulkan Implementations (github.com/ggerganov) |
| 24 points by brucethemoose2 on July 1, 2023 | past | 1 comment |
|
| | Falcon 40B Working on Ggml (github.com/ggerganov) |
| 5 points by __anon-2023__ on June 15, 2023 | past |
|
| | Llama.cpp: Full CUDA GPU Acceleration (github.com/ggerganov) |
| 728 points by gzer0 on June 13, 2023 | past | 310 comments |
|
| | K-Quants (github.com/ggerganov) |
| 3 points by tosh on June 10, 2023 | past |
|
| | Llama.cpp: Roadmap June 2023 (github.com/ggerganov) |
| 1 point by tosh on June 7, 2023 | past |
|
| | Full GPU Inference of LLaMA on Apple Silicon Using Metal (github.com/ggerganov) |
| 4 points by behnamoh on June 5, 2023 | past |
|
| | GGML: Replit and Mosaic MPT Support (github.com/ggerganov) |
| 1 point by tosh on May 12, 2023 | past |
|
| | Ggwave: Message people over sound (github.com/ggerganov) |
| 136 points by BasedDebianUsr on May 1, 2023 | past | 46 comments |
|
| | Whisper.cpp v1.4.0 (github.com/ggerganov) |
| 162 points by tosh on April 30, 2023 | past | 45 comments |
|
| | llama.cpp: Roadmap May 2023 (github.com/ggerganov) |
| 97 points by tosh on April 30, 2023 | past | 6 comments |
|
| | StableLM already being ported to ggml (github.com/ggerganov) |
| 3 points by theolivenbaum on April 20, 2023 | past |
|
| | Ggwave: Tiny Data-over-Sound Library (github.com/ggerganov) |
| 2 points by lachlan_gray on April 19, 2023 | past |
|
| | Talk.wasm: OpenAI's Whisper run in browser (github.com/ggerganov) |
| 2 points by behnamoh on April 18, 2023 | past |
|
| | Whisper.cpp now has CoreML suppprt (github.com/ggerganov) |
| 70 points by schappim on April 15, 2023 | past | 5 comments |
|
| | Llama.cpp: Add GPU support to ggml (github.com/ggerganov) |
| 3 points by mromanuk on April 12, 2023 | past |
|
| | [flagged] Revert for jart’s llama.cpp MMAP miracles (github.com/ggerganov) |
| 86 points by mmoustafa on April 2, 2023 | past | 86 comments |
|
| | [flagged] 30B model now needs only 5.8GB of RAM? How? (github.com/ggerganov) |
| 31 points by olalonde on April 1, 2023 | past | 11 comments |
|
| | Llama.cpp 30B runs with only 6GB of RAM now (github.com/ggerganov) |
| 1311 points by msoad on March 31, 2023 | past | 414 comments |
|
| | Keytap2 – acoustic keyboard eavesdropping based on language n-gram frequencies (github.com/ggerganov) |
| 2 points by tontonius on March 31, 2023 | past |
|
| | The Diff Challenge (github.com/ggerganov) |
| 29 points by another on March 22, 2023 | past | 3 comments |
|
| | Inference at the Edge (github.com/ggerganov) |
| 2 points by georgehill on March 17, 2023 | past |
|
| | Inference at the Edge (github.com/ggerganov) |
| 4 points by Mizza on March 16, 2023 | past |
|
| | Run a GPT-3 style AI on your local machine, fully on premise (github.com/ggerganov) |
| 2 points by luispa on March 14, 2023 | past |
|
| | LLaMA 7B model running on 4GB RAM Raspberry Pi 4 (github.com/ggerganov) |
| 6 points by amrrs on March 12, 2023 | past |
|
| | Llama.cpp: Port of Facebook's LLaMA model in C/C++, with Apple Silicon support (github.com/ggerganov) |
| 989 points by mrtksn on March 10, 2023 | past | 284 comments |
|
| | Talk = GPT-2 and Whisper and WASM (github.com/ggerganov) |
| 189 points by tomthe on Dec 7, 2022 | past | 50 comments |
|
| | Show HN: Port of OpenAI's Whisper model in C/C++ (github.com/ggerganov) |
| 399 points by ggerganov on Dec 6, 2022 | past | 87 comments |
|
| | Whisper: performant port of OpenAI's Whisper spech recognition model in C/C++ (github.com/ggerganov) |
| 4 points by nateb2022 on Nov 30, 2022 | past |
|
| | CPU Port of OpenAI's Whisper Speech to Text (github.com/ggerganov) |
| 41 points by abetusk on Nov 28, 2022 | past | 3 comments |
|
|
| More |