News

LLM inference in C/C++, further modified for Rubra function calling models - rubra-ai/tools.cpp ...
Below is a list of hosted API models that support multiple parallel function calls. This could include checking the weather in multiple cities or first finding the location of a hotel and then ...