Merge pull request #52 from microsoft/mbc

Model-based clustering and Arrow
microsoft · Jun 27, 2024 · 6610f76 · 6610f76
2 parents d00ff76 + db95d15
commit 6610f76
Show file tree

Hide file tree

Showing 286 changed files with 32,008 additions and 7,499 deletions.
diff --git a/.github/workflows/tests.yml b/.github/workflows/tests.yml
@@ -24,7 +24,7 @@ jobs:
       - name: Set up python
         uses: actions/setup-python@v5
         with:
-          python-version: '3.8'
+          python-version: '3.11'
           cache: 'pip'
       - name: Install dependencies
         run: |

diff --git a/.gitignore b/.gitignore
@@ -1,5 +1,8 @@
 # Project related stuff
+*.yaml
+amlt/
 dataloader/*_data/processed/
+amlt*/
 amulet_*/
 wandb
 .amltconfig
@@ -140,6 +143,27 @@ venv.bak/
 .mypy_cache/
 .dmypy.json
 dmypy.json
-
+     
 # Pyre type checker
-.pyre/
+.pyre/     
+**/eval/alpaca_eval/*
+**/eval/alpaca_eval/**/*.csv
+**/eval/alpaca_eval/**/*.json
+**/eval/mmlu/**/*.csv
+results/
+lm_cache/
+/tmp/
+
+#
+projects/instr_routing/*
+
+# any file starting with __
+_*.*
+_*/*
+# any folder starting with __
+_*/
+!__init__.py
+!*/__init__.py
+*/my_configs/*
+my_configs
+gcr_configs
diff --git a/.pre-commit-config.yaml b/.pre-commit-config.yaml
@@ -0,0 +1,5 @@
+repos:
+-   repo: https://github.com/psf/black
+    rev: 24.4.2
+    hooks:
+    - id: black
diff --git a/Dockerfile b/Dockerfile
@@ -0,0 +1,10 @@
+FROM nvcr.io/nvidia/pytorch:23.04-py3
+
+RUN apt update && apt install -y python3.8-venv git
+ENV VIRTUAL_ENV=/opt/venv
+RUN python3 -m venv $VIRTUAL_ENV
+ENV PATH="$VIRTUAL_ENV/bin:$PATH"
+
+RUN python -m pip install --upgrade pip
+COPY requirements.txt .
+RUN pip install -r requirements.txt
diff --git a/README.md b/README.md
@@ -1,82 +1,37 @@
-# MTTL
+[![Tests](https://github.com/microsoft/mttl/actions/workflows/tests.yml/badge.svg)](https://github.com/microsoft/mttl/actions/workflows/tests.yml)
 
-MTTL - Multi-Task Transfer Learning
 
-## Setup
+# MTTL - Multi-Task Transfer Learning
 
-MTTL supports `Python 3.8` and `Python 3.9`. It is recommended to create a virtual environment for MTTL using `virtualenv` or `conda`. For example, with `conda`:
+MTTL is a repository focusing on multi-task Natural Language Processing (NLP) methods that emphasize model reusability, model recombination, and parameter-efficient fine-tuning (PEFT) techniques, particularly in the context of few-shot and zero-shot learning.
 
-    conda create -n mttl python=3.9
-    conda activate mttl
+Check out our papers on ArXiv:
 
-Install the required Python packages:
+- [Towards Modular LLMs by Building and Reusing a Library of LoRAs](https://arxiv.org/abs/2405.11157)
+- [Multi-Head Adapter Routing for Cross-Task Generalization](https://arxiv.org/abs/2211.03831)
 
-    pip install -e .
 
+Refer to the [Expert Library README](projects/modular_llm/README.md) for more details on training and evaluating experts.
 
+For the code that accompanies the paper [Multi-Head Adapter Routing for Cross-Task Generalization](https://arxiv.org/abs/2211.03831), please refer to [MHR-camera-ready](https://github.com/microsoft/mttl/tree/mhr-camera-ready).
 
-## Multi-Head Adapter Routing
 
-Please ensure that you have navigated to the `projects/mhr` directory before running the Multi-Head Adapter Routing scripts:
+## Intended uses
 
-    cd projects/mhr
+MTTL is intended for research use as described in the paper [Toward Modular LLMs by Building and Reusing a Library of LoRAs](https://arxiv.org/abs/2405.11157). MTTL performance in production environments has not been tested. Considerable testing and verification are needed before the concepts and code shared are used in production environments.
 
+## Evaluations
 
-### Data Preparation
+MTTL was evaluated on a selected set of standard NLP tasks, mostly on English data. Among these tasks are common-sense reasoning, question answering, and coding. The evaluation focused on zero-shot performance, supervised adaptation, and the effectiveness of different routing strategies and library constructions using models such as Phi-2 and Mistral. Complete details on evaluations can be found in the paper.
 
-Download and prepare the datasets for the experiments using the following script:
+## Limitations
 
-    bash datasets/create_datasets.sh
+MTTL is built on top of existing language models and LoRAs. MTTL is likely to inherit any biases, risks, or limitations of the constituent parts. For example, LLMs may inadvertently propagate biases present in their training data or produce harmful or inaccurate content. MTTL has been tested for English tasks and has not yet evaluated performance multilingual scenarios. Performance for multilingual or non-English tasks is not yet known. Since MTTL was evaluated on a selected set of standard NLP tasks, performance on tasks outside of evaluated tasks covered in the paper is not yet known
 
+## Safe and responsible use
 
-### Environment Variables
+Given that MTTL is used with LoRAs chosen or built by the user, it’s important for users to fully understand the behavior and safety of the LoRAs that they use. Users should verify both the accuracy and the safety for their specific configuration and scenario.
 
-Based on your experiments, you may need to export one or more of the following environment variables:
-
-    T0_DATA_DIR:  `data/t0_data/processed` if you ran the `create_datasets.sh`
-    NI_DATA_DIR: `data/ni_data/processed` if you ran the `create_datasets.sh`
-    XFIT_DATA_DIR: `data/ni_data/processed` if you ran the `create_datasets.sh`
-    CHECKPOINT_DIR
-    OUTPUT_DIR
-    CACHE_DIR
-    STORYCLOZE_DIR: path to your downloaded `.csv` files. See [the storycloze official website](https://cs.rochester.edu/nlp/rocstories/)
-
-
-### Multi-task Pre-training
-
-The general command for pre-training a model is:
-
-    python pl_train.py -c $CONFIG_FILES -k $KWARGS
-
-Multiple `CONFIG_FILES` can be concatenated as `file1+file2`. To modify defaults, `KWARGS` can be expressed as `key=value`.
-You can check [scripts/pretrain](scripts/pretrain) for examples.
-
-### Test Fine-Tuning
-
-To perform finetuning for a test task, use the script `pl_finetune.py`
-
-### Hyper-parameter Search for Test Fine-Tuning
-
-To perform an hyperparameter search for a test task, use the script `pl_finetune_tune.py`.
-The script will just call the functions in `pl_finetune.py` in a loop. The script itself defines hp ranges for different fine-tuning types.
-
-
-### Pre-Configured Scripts
-
-Alternatively, you can run the pre-configured scripts from the `scripts` folder. For example:
-
-    bash scripts/mhr_pretrain.sh
-
-### Know Issues
-If you run into issues with protoc `TypeError: Descriptors cannot not be created directly.`, you can try to downgrade protobuf to 3.20.*:
-
-    pip install protobuf==3.20.*
-
-
-## Running Tests
-
-    pip install -e ".[test]"
-    pytest -vv tests
 
 
 ## Contributing
@@ -96,8 +51,8 @@ contact [opencode@microsoft.com](mailto:opencode@microsoft.com) with any additio
 
 ## Trademarks
 
-This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft 
-trademarks or logos is subject to and must follow 
+This project may contain trademarks or logos for projects, products, or services. Authorized use of Microsoft
+trademarks or logos is subject to and must follow
 [Microsoft's Trademark & Brand Guidelines](https://www.microsoft.com/en-us/legal/intellectualproperty/trademarks/usage/general).
 Use of Microsoft trademarks or logos in modified versions of this project must not cause confusion or imply Microsoft sponsorship.
 Any use of third-party trademarks or logos are subject to those third-party's policies.