Using Local LLM with .NET
06/25/2024
In this post I explore how one can run a LLM locally to create a chat assistant with .NET 8. I use two libraries:
There is an excellent blog post on Demystifying Retrieval Augmented Generation with .NET by Stephen Toub describing how to use the SemanticKernel library to create a chat agent.
LLamaSharp is a library that can run local LLaMA/GPT model easily and fast in C#. It uses llama.cpp under the hood. I use version 10.0.0 along with semantic kernel version 1.3.0.