Spaces:
Running
Running
Apply for a GPU community grant: Personal project
#1
by stanley-00 - opened
This is demo to evaluate small LLM model without needing for inference provider.
Hii !!
Thank You to put my model "Archaea-74M" On your inference engine.
I would suggest you to put a "token per sec" Metric in this inference engine , It would help to evalutate model speed further.
Thank You !!