manitcor@lemmy.intai.tech

manitcor@lemmy.intai.tech

Large language model evaluation and workflow framework from Phase AI. - GitHub - wgryc/phasellm: Large language model evaluation and workflow framework from Phase AI.

Docs: https://phasellm.com/docs/phasellm/eval.html

This project provides a unified framework to test generative language models on a large number of different evaluation tasks.

Features:

200+ tasks implemented. See the task-table for a complete list.
Support for models loaded via transformers (including quantization via AutoGPTQ), - GPT-NeoX, and Megatron-DeepSpeed, with a flexible tokenization-agnostic interface.
Support for commercial APIs including OpenAI, goose.ai, and TextSynth.
Support for evaluation on adapters (e.g. LoRa) supported in HuggingFace’s PEFT library.
Evaluating with publicly available prompts ensures reproducibility and comparability between papers.
Task versioning to ensure reproducibility when tasks are updated.

You must log in or register to comment.

Chat

Machine Learning - Learning/Language Models@lemmy.intai.tech

models@lemmy.intai.tech

Create a post

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Discussion of models, thier use, setup and options.

Please include models used with your outputs, workflows optional.

Model Catalog

We follow Lemmy’s code of conduct.

Communities

Useful links

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

1 user / day
1 user / week
1 user / month
5 users / 6 months
0 local subscribers
0 subscribers
50 Posts
1 Comment
Modlog

mods:
manitcor@lemmy.intai.tech

Warning: Some posts on this platform may contain adult material intended for mature audiences only. Viewer discretion is advised. By clicking ‘Continue’, you confirm that you are 18 years or older and consent to viewing explicit content.

GitHub - wgryc/phasellm: Large language model evaluation and workflow framework from Phase AI.

GitHub - wgryc/phasellm: Large language model evaluation and workflow framework from Phase AI.

Features: