Published onNovember 21, 2023Function Calling with Gorilla LLMServingFunction-CallingHost your own function calling LLM.
Published onNovember 4, 2023Fundamentals of Efficient Training on a Single GPUTrainingHere we show current out-of-the-box techniques for training on a single GPU.
Published onOctober 27, 2023Finetuning Mistral 7BFinetuningMistralHow to fine tune and serve your own Mistral 7B model.
Published onSeptember 14, 2023How Data Parallelism & Hardware Affect SpeedTrainingDDPFSDPData-ParallelismHow fast you can train your model depends on your hardware and your parallelism strategy. Knowing your hardware will guide you on which strategy you should use.
Published onAugust 14, 2023Drop-In Replacement for GPT with Llama 2 for OpenAI APILLMServingIntroductionLlama-IndexLangchainUse your favorite LLM application frameworks like Llama Index or Langchain with your self-served open source model deployment.