Wingman has a built-in configuration window that will persist configuration locally into a settings file on your machine.
Settings can be accessed from the side panel, or under the Wingman
option in the terminal pane.
The Wingman storage location is shown below:
Storage Location:
/Home Directory/.wingman
Example on macOS:
/Users/username/.wingman
We aim to support the best models available. We allow the user to configure separate models for chat and code completion, this is especially helpful when running AI models locally. Here is a list of which models we support for each provider:
You can use the following models:
Sonnet 3.7 is a pretty aggressive model, you can enable thinking mode (2048 token budget) or spark mode which will remove specific focus constraints on 3.7. Spark mode is the raw model output without the prompt restricting it.
Anthropic prompt caching is used for optimization reasons
NOTE - Unlike using Ollama, your data is not private and will not be sanitized prior to being sent to Anthropic.
You can use the following models:
NOTE - Unlike using Ollama, your data is not private and will not be sanitized prior to being sent to OpenAI
NOTE - AzureAI has general latency due to content filters on models by default. This can cause delays in responses and may require additional configuration to disable content filters.
You can use the following models:
NOTE - Unlike using Ollama, your data is not private and will not be sanitized prior to being sent Azure OpenAI.
Wingman uses a full agentic flow. You must use an Ollama model that supports tools. If you would like to add a model, please open an issue in Github.
NOTE - You can use any quantization for a supported model, you are not limited.
Example: deepseek-coder:6.7b-instruct-q4_0
Supported Models for Code Completion:
Supported Models for Chat:
NOTE - These are out of date
Supported Models for Code Completion:
Supported Models for Chat:
NOTE - Unlike using Ollama, your data is not private and will not be sanitized prior to being sent.
Settings for the extension are broken down into 4 categories.
Provider settings will include which model to use, endpoints and the API key to use. These will save per provider allowing you to switch on the fly.
General extension settings are persisted separately from the AI provider, here is a breakdown of the general settings:
Code completion can run automatically triggered by line returns, spaces and tabs. Or can by hotkeying the "Wingman: Code Complete" command.
This is an experimental version of code complete that attempts to return results faster, allowing the user to see incremental changes as they accept.
During code completion, this controls the amount of surrounding text passed to the AI provider, giving better auto completion results.
The maximum amount of tokens the code models can generate during code completion.
Controls the maximum about of tokens the AI provider will return.
Agent Features are general features for the overall experience.
Automatically accept commands and proposed file changes. When off, you will be prompted to accept each command and file change - you can view the diff before accepting. If you reject, you will be asked what modifications you'd like to make.
Automatically fix Linting/Import errors for any file modified by Wingman.
Emits an audio cue when the session has completed, or when in manual mode, emits an audio cue for each command or file you need to accept or reject.