mirror of https://github.com/sourcegraph/sourcegraph.git synced 2026-02-06 16:51:55 +00:00

History

Stephen Gutekanst 544d261e66 replace modelOverridesRecommendedSettings with selfHostedModels (#64164 ) Previously, for providing self-hosted model' configuration (the models we've tested and believe work well), a site admin would use configuration like this: ``` "modelConfiguration": { ... "modelOverridesRecommendedSettings": [ "mistral::v1::mixtral-8x7b-instruct", "bigcode::v1::starcoder2-7b" ], } ``` A few problems with this: 1. If you are NOT self-hosting models, you probably really should not be using this option, as it would set `serverSideConfig` options specific to self-hosting, but it's naming "recommended settings" which kind of suggests otherwise! 2. When self-hosting models, there is almost a 1:1 correlation of `provider` to actual API endpoint (because you have a single endpoint per model) - so not being able to configure the `mistral` or `bigcode` parts of the modelref above is problematic (restricts you to hosting 'only one model per provider'). The only escape for this currently is to abandon the defaults we provide with `modelOverridesRecommendedSettings` and rewrite it using `modelOverrides` fully yourself. 3. When self-hosting models, needing to configure the `serverSideConfig.openaicompatible.apiModel` is a really common need - the most common option probably - but again there's no way to configure it here, only option is to abandon defaults and rewrite it yourself. 4. If we improve the default values - such as if we learn that a higher context window size for `mixtral-8x7b-instruct` is better - we currently don't have a good way to 'release a new version of the defaults' because the string is a model ref `mistral::v1::mixtral-8x7b-instruct` we'd have to do this by appending `-v2` to the model name or something. Having versioning here is important because there are both: * Breaking changes: if we increase the context window at all, site admins hosting these models may need to increase limits in their hosted model deployment - or else the API may just return a hard error ('you sent me too many tokens') * Non-breaking changes: if we _decrease_ the context window, Cody responses will get faster, and it's fine to do. Similarly, adding new stop sequences may be fine for example. This PR fixes all of these^ issues by deprecating `modelOverridesRecommendedSettings` and introducing a new `selfHostedModels` field which looks like: ``` "modelConfiguration": { ... "selfHostedModels": [ { "provider": "mistral", "model": "mixtral-8x7b-instruct@v1", "override": { "serverSideConfig": { "type": "openaicompatible", "apiModel": "mixtral-8x7b-instruct-custom!" } } }, { "provider": "bigcode", "model": "starcoder2-7b@v1", "override": { "serverSideConfig": { "type": "openaicompatible", "apiModel": "starcoder2-7b-custom!" } } } ], } ``` Notably: * The `provider` part of the model ref is now configurable, enabling self-hosting more than one model per provider while still benefitting from our default model configurations. * `"model": "starcoder2-7b@v1",` is no longer a model ref, but rather a 'default model configuration name' - and has a version associated with it. * `override` allows overriding properties of the default `"model": "starcoder2-7b@v1",` configuration, like the `serverSideConfig.apiModel`. ## Importance I'm hoping to ship this to a few customers asap; * Unblocks customer https://linear.app/sourcegraph/issue/PRIME-447 * Fixes https://linear.app/sourcegraph/issue/PRIME-454 (you can see some alternatives I considered here before settling on this approach.) ## Test plan Manually tested for now. Regression tests will come in the near future and are being tracked on Linear. ## Changelog Improved configuration functionality for Cody Enterprise with Self-hosted models. --------- Signed-off-by: Stephen Gutekanst <stephen@sourcegraph.com>		2024-07-30 20:41:23 -07:00
..
aws_codecommit.schema.json
azuredevops.schema.json	Docs: update links to point to new site (#60381 )	2024-02-13 00:23:47 +00:00
batch_spec.schema.json	batches: use "keyword" as default pattern type (#63613 )	2024-07-09 10:35:01 +02:00
bitbucket_cloud.schema.json	Add support for naming repo explicitly for Bitbucket Cloud (#61536 )	2024-04-08 19:03:53 +02:00
bitbucket_server_util.go	authz/github: validate provider against default github URL if not set (#24598 )	2021-09-06 12:37:33 -04:00
bitbucket_server.schema.json	fix(Source): Fix documentation URLs for code hosts help pages (#63274 )	2024-06-17 14:32:46 -04:00
bitbucketcloud_util.go	Add Bitbucket Cloud as an auth provider with Perms syncing (#46309 )	2023-01-16 14:20:35 +02:00
BUILD.bazel	schema: Remove unused extension schema file (#63657 )	2024-07-19 21:42:49 +02:00
changeset_spec.schema.json	code-search: handle changeset fork when creating a batch change via src-cli (#58156 )	2023-11-08 09:55:05 +01:00
gerrit.schema.json	gerrit: Add support for repositoryPathPattern (#64102 )	2024-07-26 15:08:14 +02:00
github_util.go	authz/github: validate provider against default github URL if not set (#24598 )	2021-09-06 12:37:33 -04:00
github.schema.json	dotcom: Remove on-demand cloning of repositories (#63321 )	2024-06-26 14:53:14 -07:00
gitlab_util.go	authz/github: validate provider against default github URL if not set (#24598 )	2021-09-06 12:37:33 -04:00
gitlab.schema.json	chore: Move authn into cmd/frontend (#63648 )	2024-07-31 03:26:25 +02:00
gitolite.schema.json	Unremoving phabricator integration fields, adding lines to changelog (#32573 )	2022-03-15 10:01:39 -04:00
go-modules.schema.json	extsvc: Change default rate limits of npm and Go external services (#34042 )	2022-04-19 11:50:46 +00:00
json-schema-draft-07.schema.json
jvm-packages.schema.json	packages: improve and expand docs (#49774 )	2023-03-21 17:47:57 +00:00
npm-packages.schema.json	npm: Bump rate limit. (#37018 )	2022-06-10 15:00:51 +00:00
onboardingtour.schema.json	user onboarding: Use server side configuration and improve admin experience (#56768 )	2023-09-19 22:10:45 +02:00
opencodegraph-protocol.schema.json	OpenCodeGraph prototype (#58675 )	2023-12-06 21:39:33 -08:00
opencodegraph.schema.json	OpenCodeGraph prototype (#58675 )	2023-12-06 21:39:33 -08:00
other_external_service.schema.json	Remove App from codebase (#59115 )	2023-12-21 01:07:05 +01:00
package.json	web: sync TS project refenreces (#46407 )	2023-01-16 18:55:10 -08:00
pagure.schema.json	repos: add Pagure code host support (#28084 )	2021-11-23 18:03:35 +01:00
perforce.schema.json	Remove unused rateLimit on perforce connections (#58188 )	2023-11-15 03:27:14 +01:00
phabricator.schema.json
python-packages.schema.json	repos: Introduce Python dependency repos integration (#34886 )	2022-05-05 13:24:25 +02:00
README.md	site-config: Make symbols not required in syntaxHighlighting (#57276 )	2023-10-16 19:53:19 -04:00
ruby-packages.schema.json	Packages: add RubyGems support (#42817 )	2022-10-17 09:48:18 +02:00
rust-packages.schema.json	Remove experimental indexRepositoryName for rust packages (#59176 )	2024-01-08 17:42:36 +01:00
schema.go	replace modelOverridesRecommendedSettings with selfHostedModels (#64164 )	2024-07-30 20:41:23 -07:00
settings.schema.json	various improvements to saved searches (#63539 )	2024-07-15 20:12:34 +00:00
site.schema.json	replace modelOverridesRecommendedSettings with selfHostedModels (#64164 )	2024-07-30 20:41:23 -07:00
stringdata.go	Remove App from codebase (#59115 )	2023-12-21 01:07:05 +01:00
tsconfig.json	web: fix pnpm-lock issue (#47478 )	2023-02-09 22:04:31 -08:00
validation_test.go	schema: remove non-determinism from TestSchemaValidationUUID (#61728 )	2024-04-09 15:50:30 +00:00

README.md

Sourcegraph JSON Schemas

JSON Schema is a way to define the structure of a JSON document. It enables typechecking and code intelligence on JSON documents.

Sourcegraph uses the following JSON Schemas:

Modifying a schema

Edit the *.schema.json file in this directory.
Run bazel run //schema:write_generated_schema.
Commit the changes to both files.
Run sg start to automatically update TypeScript schema files.

Known issues

The JSON Schema IDs (URIs) are of the form https://sourcegraph.com/v1/*.schema.json#, but these are not actually valid URLs. This means you generally need to supply them to JSON Schema validation libraries manually instead of having the validator fetch the schema from the web.