Posts sorted in themes

The codes developed for posts were only for those with expressed intent. Posts that do not make the OP’s intention of posting clear were not considered.

Addressing a shared and in-demand problem of the community (spc)

Note: adding properties post_id, github, huggingface, notebook, and blogpost to all of these.

Posts

Desiring feedback from the community (ffc)

Posts

Peer education pe

Has a lot of overlapping codes with ffc and spc.

Asking the community to share their projects spi

Desiring post to inspire others (pio)

Giving back to the community (gbc)

Posts sorted in project types

Table

Filepost_idgithubhuggingfaceblogpostnotebook
A common misconception about RAG frameworkst3_1dp9fgu----
A paradigm shift in machine translationt3_16p2smj----
AutoRAGt3_1aulov2https://github.com/Marker-Inc-Korea/AutoRAG---
Axiom prompt engineeringt3_1hhkeh3https://github.com/codedidit/axiomprompting---
Background erase networkt3_1gpzqkj---
Benchmarking effects of quantizationt3_1cdxjaxhttps://github.com/jd-3d/MPA_Bench/tree/main---
Click3-Android agentt3_1hgu5qihttps://github.com/BandarLabs/clickclickclick---
Collection of open source RAG techniquest3_1eqec8v---
Comparing different whisper packagest3_1brqwun--https://amgadhasan.substack.com/p/sota-asr-tooling-long-form-transcription-
Dangers of malicious GGUF filest3_1bwjxaj----
Do you guys finetune models?t3_1evhqin----
EntityDBt3_1hryy21https://github.com/babycommando/entity-db---
Experience with Codestral for Android developmentt3_1ds9ogn----
FastApply-open source Cursort3_1ds9ognhttps://github.com/kortix-ai/fast-apply--
GGUF development for llama.cppt3_15triq2https://github.com/ggerganov/llama.cpp/pull/2398#issuecomment-1682404719---
GGUF security advisoryt3_1bist4o----
GPT-4 alternativest3_18he4lg----
How many people are fine tuning?t3_1f5fgn3----
How we chunkt3_1dpb9ow----
Huge LLM Comparisont3_17kpyd2----
LLM fine-tuning datasetst3_1cg2ce7---
LLM4SQLt3_1btz6x4https://github.com/gd03champ/llm4sql---
Lexi Llama-3-8B-Uncensoredt3_1cbhqzk-https://huggingface.co/Orenguteng/Lexi-Llama-3-8B-Uncensored--
Llama 2 prompt formatt3_155po2p--https://huggingface.co/blog/llama2#how-to-prompt-llama-2-
Llama 3 GGUF conversion bugt3_1ckvx9l----
Llama 3 quants comparisont3_1cst400https://github.com/matt-c1/llama-3-quant-comparison---
LLM “serious use” comparisont3_172ai2j----
Local translator based on LLaMAt3_145s65p---
Misguided attentiont3_1cwa3jl---
Mistral-Large 35GB quantizationt3_1elbn3q-https://huggingface.co/ChenMnZ/Mistral-Large-Instruct-2407-EfficientQAT-w2g64-GPTQ--
NexaAI—Ollama alternativet3_1fc3yjthttps://github.com/NexaAI/nexa-sdk---
Omnivision-968Mt3_1grkq4j-https://nexa.ai/blogs/omni-vision-
Open LLM Leaderboard new interfacet3_1hbb85n---
Open source coding assistants?t3_1as9pi4----
Open-source Perplexity alternativet3_1cuxah4---
Optimize Whisper for fast inferencet3_1d1xzpi----
OpusV1 models for story-writing and role-playingt3_1b2apia---
Parler TTS v1t3_1encx98--
Perfect labels via prompting 2t3_1actbr1----
Perfect labels via promptingt3_1amvfua----
PocketPalt3_1fppt99----
Prompt format comparisonst3_18ljvxb----
Quantizing Llama 3 8B seems more harmfult3_1cci5w6----
Qwen2-Audiot3_1gzq2erhttps://github.com/NexaAI/nexa-sdkhttps://huggingface.co/NexaAIDev/Qwen2-Audio-7B-GGUF--
Qwen2.5 14B quant comparisont3_1flqwzw----
Qwen2.5 32B quant comparisont3_1fkm5vd----
RAGBuildert3_1f04ib2https://github.com/kruxai/ragbuilder---
SmallThinker-3B-Previewt3_1hpop3y---
Translate to 400+ languagest3_17qt6m4---
Trailing whitespace in promptst3_17dyc8a----
Unsloth dynamic quantizationt3_1h6ojwr-https://huggingface.co/unsloth/QwQ-32B-Preview-unsloth-bnb-4bithttps://unsloth.ai/blog/dynamic-4bithttps://colab.research.google.com/drive/1T5-zKWM_5OD21QHwXHiV9ixTRR7k3iB9?usp=sharing
UGI-Leaderboard remaket3_1i0ou0v---
Unslotht3_188197jhttps://github.com/unslothai/unsloth---
Vast.ai cloud inferencing guidet3_13zknvj----
WizardVicunaLMt3_1376ohohttps://github.com/melodysdreamj/WizardVicunaLM--
Yet another RAG systemt3_16cbimihttps://github.com/snexus/llm-search/tree/main---
Your best RAG projectst3_1e5n96c----
exl2 quantizationt3_18eyf39https://github.com/turboderp-org/exllamav2https://huggingface.co/LoneStriker/Aetheria-L2-70B-2.4bpw-h6-exl2-2--
llama.cpp —logit-bias flagt3_13j3747----
llama.cpp breaking changet3_13md90j---
voicechat2t3_1eju211https://github.com/lhl/voicechat2---
Success with a local voice chatt3_13snjvx---
AgentSearcht3_18ntozg--
llama.cpp web search integrationt3_1fhaqjg---