Apply for a GPU community grant: Personal project

#1
by stanley-00 - opened

This is demo to evaluate small LLM model without needing for inference provider.

Hii !!
Thank You to put my model "Archaea-74M" On your inference engine.
I would suggest you to put a "token per sec" Metric in this inference engine , It would help to evalutate model speed further.

Thank You !!

Sign up or log in to comment