HuggingFace pipeline returns empty string with Gemini 1.5 Pro
Last week a quick text‑generation pipeline came back blank every call.
... ➦vLLM tokenizer mismatch on finetuned Mistral model
Spent the afternoon benchmarking a Mistral‑7B finetune with vLLM. First prompt returned gibberish tokens.
... ➦OpenRouter API response missing ’text’ in JSON payload
Testing OpenRouter with gpt‑4 turbo. Request looked fine but the completion field came back empty:
... ➦langchain.tools.SerpAPIWrapper throws Invalid API Key despite valid key
Spotted this today while wiring a quick search tool.
... ➦Gemini Pro API throws 403 in Postman with valid key
Tried hitting the Gemini Pro v1/models/gemini-pro:generateContent endpoint in Postman.
Mistral inference on local GPU hits OOM with 13B model
Last week I tried the 13B Mistral model on a single RTX 3060 (12 GB). python main.py crashed instantly:
Ollama install on Windows missing symlink permissions
Setting up Ollama on Windows 11 this morning. Installer finished, but every ollama run threw:
langchain.chains.ConversationalRetrievalChain() fails with FAISS store
Tried to wire ConversationalRetrievalChain with a FAISS vector store. The constructor crashed.
... ➦Gradio app stalls on LLaMA 2 inference with 8‑bit quant
Spun up a quick Gradio demo around LLaMA‑2‑7B‑chat‑hf using 8‑bit quantisation. Prompt box froze after I hit “Submit”; no tokens returned.
... ➦AutoGPT stuck repeating the same task loop
Last night I let AutoGPT try a simple research job. It found the first website, wrote a plan, then looped on the same “visit site” action forever.
... ➦