Page History

Multi-Domain Expert Learning (M*DEL) is an approach to training LLMs for expertise in knowledge domains.

This process involves branching from a base model and training the branch on specific domain data in order to establish an expert layer, which routing logic is able to activate in serving inference requests. The result is a framework for domain expertise that is easily-extensible, modular, and efficient.

...

Info
To use example data for training, see the `axolotl-mdel/scripts/` directory`download-mtg.py`

Next, make a configuration file to use for this training run with a meaningful filename; here, it is called "experiment1":

...

Code Block

language	text

$ huggingface-cli upload [repo-id] [local-path] [path-in-repo] --token=[read/write tokentoken with WRITE permission]

For example:

Code Block

language	text

$ huggingface-cli upload stillerman/aurora-mathematica ./lora-out/ . --token=...

This uploads the model to the Hub at HuggingFace and is public, by default. Add the --private flag to make it private.

Page tree

Versions Compared

Old Version 4

New Version 5

Key