NVIDIA
NVIDIA
Finetune Mistral 7B Using Brev.dev Quick Deploy
Resource
NVIDIA
NVIDIA
Finetune Mistral 7B Using Brev.dev Quick Deploy

In this notebook, we will use NVIDIA's NeMo framework to finetune the Mistral 7B LLM. Finetuning can be done using Brev quick deploy option.

Finetune Mistral 7B using NVIDIA NeMo and PEFT

In this notebook, we will use NVIDIA's NeMo framework to finetune the Mistral 7B LLM. Finetuning is the process of adjusting the weights of a pre-trained foundation model with custom data. Considering that foundation models can be significantly large, a variant of fine-tuning has gained traction recently, known as parameter-efficient fine-tuning (PEFT). PEFT encompasses several methods, including P-Tuning, LoRA, Adapters, and IA3. For those interested in a deeper understanding of these methods, we have included a list of additional resources below.

Deploy now

To streamline your experience and jump directly into a GPU-accelerated environment with this notebook and NeMo pre-installed, click the badge below. Our 1-click deploys are powered by Brev.dev.

 Click here to deploy.

Getting started

Use the 1-click deploy link above to set up a machine with NeMo installed. Once the VM is ready, use the Access Notebook button to enter the Jupyter Lab instance

Model

For this notebook, we use the Mistral-7B parameter model and the NeMo framework. We will be finetuning on the PubMedQA dataset and training our model to respond with simple yes/no answer. PubMedQA is a novel biomedical question answering (QA) dataset collected from PubMed abstracts.

NeMo

NVIDIA NeMo framework is a generative AI framework built for researchers and pytorch developers working on large language models (LLMs), multimodal models (MM), automatic speech recognition (ASR), and text-to-speech synthesis (TTS). NeMo provides a scalable framework to easily design, implement, and scale new AI models using existing pre-trained models and a simple API for configuration.

Publisher
NVIDIA
NVIDIA
Latest Version1
UpdatedAugust 13, 2024 UTC
Compressed Size4.34 KB

NVIDIA uses cookies to improve your experience on our web site. We and our third-party partners also use cookies and other tools to collect and record information you provide as well as information about your interactions with our websites for performance improvement, analytics, and to assist in marketing efforts. By clicking "Accept All", you consent to our use of cookies and other tools as described in our Cookie Policy. You can manage your cookie settings by clicking on "Manage Settings." By continuing to use this site or by clicking one of the buttons below, you agree to our Terms of Service (which contains important waivers). Please see our Privacy Policy for more information on our privacy practices.