How To SIMULATE A Function Calling Dataset!

hi everyone!

i want to share with you a little project i created a few months ago to solve a problem i was having with function calling. whenever i needed a good quality and specific dataset to train my models on function calling i couldn’t find a good repo for generation. i wanted a dataset that teaches the model not only how to call the tool but also when, in different contexts. i also wanted to have maniacal control on the results, i wanted to control how many tools in each convo, when the tool is called, errors in tool callings and in particular i wanted something that was flexible enought to include *PERSONALIZED* tools with personalized mock answers!!!

for example you can find some tools i made for the sample below in the repo under

synthfc/tools/eng

and

synthfc/tools/ita

i also wanted a way to check the results and auto-correct the pieces of data that have problems. here is the repo:

https://github.com/pierpierpy/FC-synth

here some examples i created with an open source model:

https://huggingface.co/datasets/pierjoe/function-calling-synthetic-2000

hope you find it useful!

happy tool calling!

submitted by /u/Logical_Delivery8331
[link] [comments]

Leave a Reply

Your email address will not be published. Required fields are marked *