Supported Models#
Flywheel supports a variety of Transformer models. Below we provide the model architectures that are currently supported. Alongside each architecture, we include some popular models that use it and support for NVIDIA and AMD.
Architecture |
Models |
Example Models |
NVIDIA |
AMD |
---|---|---|---|---|
LlamaForCausalLM |
CodeLlama, |
code-llama/CodeLlama-7B, |
✅︎ |
✅︎ |
MistralForCausalLM |
Mistral |
mistralai/Mistral-7B, |
✅︎ |
✅︎ |
MixtralForCausalLM |
Mixtral |
mistralai/Mixtral-8x7B, |
✅︎ |
✅︎ |
GemmaForCausalLM |
Gemma 2 |
google/gemma2-27B |
✅︎ |
- |
Qwen2ForCausalLM |
Qwen1.5 |
Qwen/Qwen1.5-7B, |
✅︎ |
✅︎ |
MPTForCausalLM |
Mosaic, |
aisingapore/sea-lion-7B, |
✅︎ |
- |
DbrxForCausalLM |
dbrx |
databricks/dbrx-instruct |
- |
✅︎ |