Prototyping With AI Models
Prototyping With AI Models
In this article
Finding AI models
Experimenting with AI models in the playground
Going to production
Rate limits
Leaving feedback
If you want to develop a generative AI application, you can use GitHub Models to find and experiment
with AI models for free. Once you are ready to bring your application to production, you can switch to
a token from a paid Azure account. See the Azure AI documentation.
Finding AI models
To find an AI model:
1 Go to github.com/marketplace/models.
Alternatively, in the dropdown menu, click View all models, click a model in the Marketplace,
then click Playground.
The model is opened in the model playground. Details of the model are displayed in the sidebar on
the right. If the sidebar is not displayed, expand it by clicking the icon at the right of the
playground.
Note
Note
To adjust parameters for the model, in the playground, select the Parameters tab in the sidebar.
To see code that corresponds to the parameters that you selected, switch from the Chat tab to the
Code tab.
Comparing models
You can submit a prompt to two models at the same time and compare the responses.
With one model open in the playground, click Compare, then, in the dropdown menu, select a model
for comparison. The selected model opens in a second chat window. When you type a prompt in
either chat window, the prompt is mirrored to the other window. The prompts are submitted
simultaneously so that you can compare the responses from each model.
Note
GitHub provides free API usage so that you can experiment with AI models in your own application.
The steps to use each model are similar. In general, you will need to:
1 Go to github.com/marketplace/models.
All models can be used with the Azure AI Inference SDK, and some models support additional
SDKs. If you want to easily switch between models, you should select "Azure AI Inference SDK". If
you selected "REST" as the language, you won't use an SDK. Instead, you will use the API
endpoint directly.
To run in a codespace, click Run codespace, then click Create new codespace.
To run locally:
Create a GitHub personal access token. The token should not have any scopes or
permissions. See Managing your personal access tokens.
Save your token as an environment variable.
Install the dependencies for the SDK, if required.
The free API usage is rate limited. See Rate limits below.
To create a preset for your current context, select Preset: PRESET-NAME at the top right of the
playground, then click Create new preset. You need to name your preset, and you can also choose
to provide a preset description, include your chat history, and allow your preset to be shared.
Select the Preset: PRESET-NAME dropdown menu, then click the preset you want to load.
Open a shared preset URL
After you load a preset, you can edit, share, or delete the preset:
To edit the preset, change the parameters and prompt the model. Once you are satisfied with
your changes, select the Preset: PRESET-NAME dropdown menu, then click Edit preset and
save your updates.
To share the preset, select the Preset: PRESET-NAME dropdown menu, then click Share
preset to get a shareable URL.
To delete the preset, select the Preset: PRESET-NAME dropdown menu, then click Delete
preset and confirm the deletion.
Quickly test and refine prompts without the complexity of multi-turn interactions.
Fine-tune prompts for precision and relevance in your projects.
Use a specialized space for single-turn scenarios to ensure consistent and optimized results.
To access the prompt editor, click Prompt editor at the top right of the playground.
Note
The AI Toolkit extension for Visual Studio Code is in public preview and is subject to change.
If you prefer to experiment with AI models in your IDE, you can install the AI Toolkit extension for
Visual Studio Code, then test models with adjustable parameters and context.
1 In Visual Studio Code, install the pre-release version of the AI Toolkit for Visual Studio Code.
2 To open the extension, click the AI Toolkit icon in the activity bar.
4 In the "My models" section of the AI Toolkit panel, click Open Model Catalog, then find a model
to experiment with.
To use a model hosted remotely through GitHub Models, on the model card, click Try in
playground.
To download and use a model locally, on the model card, click Download. Once the
download is complete, on the same model card, click Load in playground.
5 In the sidebar, provide any context instructions and inference parameters for the model, then
send a prompt.
Going to production
The rate limits for the playground and free API usage are intended to help you experiment with
models and develop your AI application. Once you are ready to bring your application to production,
you can use a token from a paid Azure account instead of your GitHub personal access token. You
don't need to change anything else in your code.
Rate limits
The playground and free API usage are rate limited by requests per minute, requests per day, tokens
per request, and concurrent requests. If you get rate limited, you will need to wait for the rate limit
that you hit to reset before you can make more requests.
Low, high, and embedding models have different rate limits. To see which type of model you are
using, refer to the model's information in GitHub Marketplace.
Rate limit tier Rate limits Free and Copilot Copilot Business Copilot Enterprise
Individual
Tokens per request 8000 in, 4000 out 8000 in, 4000 out 8000 in, 8000 out
Concurrent requests 5 5 8
Tokens per request 8000 in, 4000 out 8000 in, 4000 out 16000 in, 8000 out
Concurrent requests 2 2 4
Concurrent requests 5 5 8
Tokens per request 4000 in, 4000 out 4000 in, 4000 out 4000 in, 8000 out
Concurrent requests 1 1 1
Tokens per request 4000 in, 4000 out 4000 in, 4000 out 4000 in, 4000 out
Concurrent requests 1 1 1
Leaving feedback
To leave feedback about GitHub Models, start a new discussion or comment on an existing discussion
in the GitHub Community.
Legal
© 2025 GitHub, Inc. Terms Privacy Status Pricing Expert services Blog