sourcegraph

mirror of https://github.com/sourcegraph/sourcegraph.git synced 2026-02-06 19:21:50 +00:00

Author	SHA1	Message	Date
Christoph Hegemann	e48368df53	Enable SCIP based APIs by default (#64285 ) Closes https://linear.app/sourcegraph/issue/GRAPH-784/default-scip-api-ff-to-on Enable SCIP based APIs by default, as they're required for the web app refresh. ## Test plan Things have been going fine on Dotcom and S2	2024-08-06 06:12:30 +00:00
Petri-Johan Last	44e848d4ba	Enable p4-fusion by default for Perforce code host connections (#64101 )	2024-08-02 11:15:16 +02:00
Camden Cheek	ee3a710b7c	Chore: remove enablePreciseOccurrences feature flag (#64229 ) Just removes a redundant feature flag.	2024-08-02 00:02:53 +00:00
Stephen Gutekanst	544d261e66	replace modelOverridesRecommendedSettings with selfHostedModels (#64164 ) Previously, for providing self-hosted model' configuration (the models we've tested and believe work well), a site admin would use configuration like this: ``` "modelConfiguration": { ... "modelOverridesRecommendedSettings": [ "mistral::v1::mixtral-8x7b-instruct", "bigcode::v1::starcoder2-7b" ], } ``` A few problems with this: 1. If you are NOT self-hosting models, you probably really should not be using this option, as it would set `serverSideConfig` options specific to self-hosting, but it's naming "recommended settings" which kind of suggests otherwise! 2. When self-hosting models, there is almost a 1:1 correlation of `provider` to actual API endpoint (because you have a single endpoint per model) - so not being able to configure the `mistral` or `bigcode` parts of the modelref above is problematic (restricts you to hosting 'only one model per provider'). The only escape for this currently is to abandon the defaults we provide with `modelOverridesRecommendedSettings` and rewrite it using `modelOverrides` fully yourself. 3. When self-hosting models, needing to configure the `serverSideConfig.openaicompatible.apiModel` is a really common need - the most common option probably - but again there's no way to configure it here, only option is to abandon defaults and rewrite it yourself. 4. If we improve the default values - such as if we learn that a higher context window size for `mixtral-8x7b-instruct` is better - we currently don't have a good way to 'release a new version of the defaults' because the string is a model ref `mistral::v1::mixtral-8x7b-instruct` we'd have to do this by appending `-v2` to the model name or something. Having versioning here is important because there are both: * Breaking changes: if we increase the context window at all, site admins hosting these models may need to increase limits in their hosted model deployment - or else the API may just return a hard error ('you sent me too many tokens') * Non-breaking changes: if we _decrease_ the context window, Cody responses will get faster, and it's fine to do. Similarly, adding new stop sequences may be fine for example. This PR fixes all of these^ issues by deprecating `modelOverridesRecommendedSettings` and introducing a new `selfHostedModels` field which looks like: ``` "modelConfiguration": { ... "selfHostedModels": [ { "provider": "mistral", "model": "mixtral-8x7b-instruct@v1", "override": { "serverSideConfig": { "type": "openaicompatible", "apiModel": "mixtral-8x7b-instruct-custom!" } } }, { "provider": "bigcode", "model": "starcoder2-7b@v1", "override": { "serverSideConfig": { "type": "openaicompatible", "apiModel": "starcoder2-7b-custom!" } } } ], } ``` Notably: * The `provider` part of the model ref is now configurable, enabling self-hosting more than one model per provider while still benefitting from our default model configurations. * `"model": "starcoder2-7b@v1",` is no longer a model ref, but rather a 'default model configuration name' - and has a version associated with it. * `override` allows overriding properties of the default `"model": "starcoder2-7b@v1",` configuration, like the `serverSideConfig.apiModel`. ## Importance I'm hoping to ship this to a few customers asap; * Unblocks customer https://linear.app/sourcegraph/issue/PRIME-447 * Fixes https://linear.app/sourcegraph/issue/PRIME-454 (you can see some alternatives I considered here before settling on this approach.) ## Test plan Manually tested for now. Regression tests will come in the near future and are being tracked on Linear. ## Changelog Improved configuration functionality for Cody Enterprise with Self-hosted models. --------- Signed-off-by: Stephen Gutekanst <stephen@sourcegraph.com>	2024-07-30 20:41:23 -07:00
Erik Seliger	c4c375a642	chore: Move authn into cmd/frontend (#63648 ) They should not be used outside of cmd/frontend, so making it a frontend internal package. While doing that, I realized that there is a coupling dependency between authz providers and auth (which is authN) providers: GitLab code host connections can do authz mapping via the usernames of another OIDC or SAML auth provider (https://sourcegraph.com/docs/admin/code_hosts/gitlab#administrator-sudo-level-access-token). It turns out this feature does not work anymore, since at least several releases, because we don't actually instantiate auth providers outside of `cmd/frontend` and thus the mapping will never find anything (auth providers don't explode when queried before init, unlike authz). This only now became clear as I moved this code, and the dependency graph was broken, so that's a nice property of these cleanups I guess 😬 Since it doesn't seem to work for quite some time, I opted for removing it, and added a changelog entry about it. Not sure if that is sufficient, I raised a thread here: https://sourcegraph.slack.com/archives/C03K05FCRFH/p1721848436473209. This would've prevented this change and needed more refactoring as unfortunately we cannot map an auth provider by the conf type to a record in the `user_external_accounts` table and need to actually instantiate it. Test plan: Compiler doesn't complain, tests still pass. ## Changelog GitLab code host connections were [able to sync permissions by mapping Sourcegraph users to GitLab users via the username property of an external OIDC or SAML provider](https://sourcegraph.com/docs/admin/code_hosts/gitlab#administrator-sudo-level-access-token) that is shared across Sourcegraph and GitLab. This integration stopped working a long time ago, and it has been removed in this release.	2024-07-31 03:26:25 +02:00
Erik Seliger	38b79fbb2f	authz: Compute providers on the fly (#64012 ) Previously, we would store authz providers globally and refresh them every now and then. However, creating the providers is fairly cheap (1.3ms in a local trace) so we should not keep them in memory and remember to not forget to start the watcher routine. This will help for multi-tenant Sourcegraph in that providers are now computed for the context in question, and not held globally. Keeping potentially 100k authz providers in memory will not scale. Test plan: Still works, local Jaeger traces are quite acceptable.	2024-07-31 02:59:41 +02:00
Rafał Gajdulewicz	963527ccd9	Support detecting search and edit intent (#64129 ) Support detecting `search` and `edit` intents - return additional scores for those two categories. ## Test plan - tested locally -> use `{ chatIntent(query: "yo", interactionId: "123") { intent score searchScore editScore } } ` as GraphQL payload	2024-07-30 09:20:45 +00:00
Chris Smith	caa4301bcb	Remove references to modelconfig.sourcegraph.pollingInterval (#63956 ) As the next release of Sourcegraph approaches, and we get our ducks in a row for rolling out the modelconfig changes, we have cut the ability for the Sourcegraph backend to poll Cody Gateway for new LLM models. Instead, if configured to, the only "Sourcegraph-supplied models" will be what is embedded into the binary at build-time. (See `internal/modelconfig/embedded`.) This PR removes any externally facing references to this capability. Since it wasn't actually implemented yet, this isn't actually going to change any functionality. We'll add this capability in the next release, ~September. See [PRIME-290](https://linear.app/sourcegraph/issue/PRIME-290/feature-sourcegraph-instances-can-automatically-pick-up-new-llms). ## Test plan NA ## Changelog NA	2024-07-29 23:20:24 +00:00
Erik Seliger	d249b8f701	gerrit: Add support for repositoryPathPattern (#64102 ) Most other code host connections we support have this property to control the generated name, but for some reason it was forgotten for Gerrit. This PR adds it, plus a few tests. ![Screenshot 2024-07-26 at 14.04.35@2x.png](https://graphite-user-uploaded-assets-prod.s3.amazonaws.com/Xvbrpl1hwVbe4tb9QeLp/6b7f2e35-1147-4e6d-912c-628ea94f2f33.png) Test plan: Added tests, cloned a gerrit repo locally. ## Changelog Added support for the `repositoryPathPattern` setting for Gerrit code host connections.	2024-07-26 15:08:14 +02:00
Erik Seliger	c4abb510b1	schema: Remove unused extension schema file (#63657 ) The schema file was removed long ago, this removes the Go code for it as well, as there are no more references to it. Test plan: Go compiler doesn't complain about missing symbols.	2024-07-19 21:42:49 +02:00
Stephen Gutekanst	dca1b9694d	self hosted models (#63899 ) This PR is stacked on top of all the prior work @chrsmith has done for shuffling configuration data around; it implements the new "Self hosted models" functionality. ## Configuration Configuring a Sourcegraph instance to use self-hosted models basically involves adding some configuration like this to the site config (if you set `modelConfiguration`, you are opting in to the new system which is in early access): ``` // Setting this field means we are opting into the new Cody model configuration system. "modelConfiguration": { // Disable use of Sourcegraph's servers for model discovery "sourcegraph": null, // Create two model providers "providerOverrides": [ { // Our first model provider "mistral" will be a Huggingface TGI deployment which hosts our // mistral model for chat functionality. "id": "mistral", "displayName": "Mistral", "serverSideConfig": { "type": "huggingface-tgi", "endpoints": [{"url": "https://mistral.example.com/v1"}] }, }, { // Our second model provider "bigcode" will be a Huggingface TGI deployment which hosts our // bigcode/starcoder model for code completion functionality. "id": "bigcode", "displayName": "Bigcode", "serverSideConfig": { "type": "huggingface-tgi", "endpoints": [{"url": "http://starcoder.example.com/v1"}] } } ], // Make these two models available to Cody users "modelOverridesRecommendedSettings": [ "mistral::v1::mixtral-8x7b-instruct", "bigcode::v1::starcoder2-7b" ], // Configure which models Cody will use by default "defaultModels": { "chat": "mistral::v1::mixtral-8x7b-instruct", "fastChat": "mistral::v1::mixtral-8x7b-instruct", "codeCompletion": "bigcode::v1::starcoder2-7b" } } ``` More advanced configurations are possible, the above is our blessed configuration for today. ## Hosting models Another major component of this work is starting to build up recommendations around how to self-host models, which ones to use, how to configure them, etc. For now, we've been testing with these two on a machine with dual A100s: * Huggingface TGI (this is a Docker container for model inference, which provides an OpenAI-compatible API - and is widely popular) * Two models: * Starcoder2 for code completion; specifically `bigcode/starcoder2-15b` with `eetq` 8-bit quantization. * Mixtral 8x7b instruct for chat; specifically `casperhansen/mixtral-instruct-awq` which uses `awq` 4-bit quantization. This is our 'starter' configuration. Other models - specifically other starcoder 2, and mixtral instruct models - certainly work too, and higher parameter versions may of course provide better results. Documentation for how to deploy Huggingface TGI, suggested configuration and debugging tips - coming soon. ## Advanced configuration As part of this effort, I have added a quite extensive set of configuration knobs to to the client side model configuration (see `type ClientSideModelConfigOpenAICompatible` in this PR) Some of these configuration options are needed for things to work at a basic level, while others (e.g. prompt customization) are not needed for basic functionality, but are very important for customers interested in self-hosting their own models. Today, Cody clients have a number of different _autocomplete provider implementations_ which tie model-specific logic to enable autocomplete, to a provider. For example, if you use a GPT model through Azure OpenAI, the autocomplete provider for that is entirely different from what you'd get if you used a GPT model through OpenAI officially. This can lead to some subtle issues for us, and so it is worth exploring ways to have a _generalized autocomplete provider_ - and since with self-hosted models we _must_ address this problem, these configuration knobs fed to the client from the server are a pathway to doing that - initially just for self-hosted models, but in the future possibly generalized to other providers. ## Debugging facilities Working with customers in the past to use OpenAI-compatible APIs, we've learned that debugging can be quite a pain. If you can't see what requests the Sourcegraph backend is making, and what it is getting back.. it can be quite painful to debug. This PR implements quite extensive logging, and a `debugConnections` flag which can be turned on to enable logging of the actual request payloads and responses. This is critical when a customer is trying to add support for a new model, their own custom OpenAI API service, etc. ## Robustness Working with customers in the past, we also learned that various parts of our backend `openai` provider were not super robust. For example, [if more than one message was present it was a fatal error](https://github.com/sourcegraph/sourcegraph/blob/main/internal/completions/client/openai/openai.go#L305), or if the SSE stream yielded `{"error"}` payloads, they would go ignored. Similarly, the SSE event stream parser we use is heavily tailored towards [the exact response structure](https://github.com/sourcegraph/sourcegraph/blob/main/internal/completions/client/openai/decoder.go#L15-L19) which OpenAI's official API returns, and is therefor quite brittle if connecting to a different SSE stream. For this work, I have _started by forking_ our `internal/completions/client/openai` - and made a number of major improvements to it to make it more robust, handle errors better, etc. I have also replaced the usage of a custom SSE event stream parser - which was not spec compliant and brittle - with a proper SSE event stream parser that recently popped up in the Go community: https://github.com/tmaxmax/go-sse My intention is that after more extensive testing, this new `internal/completions/client/openaicompatible` provider will be more robust, more correct, and all around better than `internal/completions/client/openai` (and possibly the azure one) so that we can just supersede those with this new `openaicompatible` one entirely. ## Client implementation Much of the work done in this PR is just "let the site admin configure things, and broadcast that config to the client through the new model config system." Actually getting the clients to respect the new configuration, is a task I am tackling in future `sourcegraph/cody` PRs. ## Test plan 1. This change currently lacks any unit/regression tests, that is a major noteworthy point. I will follow-up with those in a future PR. * However, these changes are incredibly isolated, clearly only affecting customers who opt-in to this new self-hosted models configuration. * Most of the heavy lifting (SSE streaming, shuffling data around) is done in other well-tested codebases. 2. Manual testing has played a big role here, specifically: * Running a dev instance with the new configuration, actually connected to Huggingface TGI deployed on a remote server. * Using the new `debugConnections` mechanism (which customers would use) to directly confirm requests are going to the right places, with the right data and payloads. * Confirming with a new client (changes not yet landed) that autocomplete and chat functionality work. Can we use more testing? Hell yeah, and I'm going to add it soon. Does it work quite well and have small room for error? Also yes. ## Changelog Cody Enterprise: added a new configuration for self-hosting models. Reach out to support if you would like to use this feature as it is in early access. --------- Signed-off-by: Stephen Gutekanst <stephen@sourcegraph.com>	2024-07-19 01:34:02 +00:00
Rafał Gajdulewicz	25929d1be9	Integrate Cohere re-ranking API (#63877 ) Integrates Cohere re-ranking [API](https://cohere.com/rerank) for server-side Cody Context ([RFC 969](https://linear.app/sourcegraph/project/v1-of-two-stage-intent-detection-context-retrieval-system-c4f7093e9eab/overview)). Before this PR, we only supported `identity` ranker (which returned all items in the input order), which is still the default choice (when Cohere API key is not provided). Closes https://linear.app/sourcegraph/issue/AI-134/add-non-poc-ranking ## Test plan - tested locally, use ``` "cody.serverSideContext": { "reranker": { "type": "cohere", "apiKey": "TOKEN" } } ``` to test locally	2024-07-17 19:20:13 +00:00
Rafał Gajdulewicz	66fd1b5172	Add configuration for Intent Detection API (#63871 ) Adds site-config configuration for RFC 969 intent detection, making the Intent Detection API endpoint and token configurable without code changes. Additionally, adds an option to hit multiple intent detection backends with the same query. Previously, URL was hardcoded in code, so if the backend has changed, we had to redeploy sourcegraph.com. As we iterate on intent detection, we want to be able to test multiple models in parallel, so this PR adds a setting for `extra` backends - if provided, additional .com -> backend requests will be sent, but the client-initiated request will not wait for those requests. Closes AI-128. ## Test plan - tested locally - add ``` "cody.serverSideContext": { "intentDetectionAPI": { "default": { "url": "http://35.188.42.13:8000/predict/linearv2" }, "extra": [ { "url": "http://35.188.42.13:8000/predict/linearv2" } ] } } ``` to `experimentalFeatures` in dev-private.	2024-07-17 09:47:08 +00:00
Varun Gandhi	8597286c8f	feat: Add experimental feature to control commit graph updates (#63870 )	2024-07-17 16:51:18 +08:00
Quinn Slack	7ba706e65d	various improvements to saved searches (#63539 ) - Remove long-deprecated and long-ineffective notifications for saved searches (removed in `de8ae5ee28` 2.5 years ago). Note that code monitors were the replacement for saved searches and work great. - Clean up UI. - Make the UI global instead of in the user/org area. - Convert React class components to function components. - Add default `patterntype:` because it's required. - Use `useQuery` and `useMutation` instead of `requestGraphQL`. - Use a single namespace `owner` GraphQL arg instead of separating out `userID` and `orgID`. - Clean up GraphQL resolver code and factor out common auth checking. - Support transferring ownership of saved searches among owners (the user's own user account and the orgs they're a member of). (I know this is not in Svelte.) SECURITY: There is one substantive change. Site admins may now view any user's and any org's saved searches. This is so that they can audit and delete them if needed. ![image](https://github.com/sourcegraph/sourcegraph/assets/1976/7ba22c1c-b92e-4089-836b-135a503c96a0) ![image](https://github.com/sourcegraph/sourcegraph/assets/1976/a1f2f43d-f681-4ec9-b2a1-8273707b34ee) ![image](https://github.com/sourcegraph/sourcegraph/assets/1976/5ee1164f-ed2e-4144-9aca-db61fa7c20f4) ![image](https://github.com/sourcegraph/sourcegraph/assets/1976/d631529e-6c0d-49c6-9be1-33a7ff53ed97) ## Test plan Try creating, updating, and deleting saved searches, and transferring ownership of them. ## Changelog - Improved the saved searches feature, which lets you save search queries to easily reuse them later and share them with other people in an organization. - Added the ability to transfer ownership of a saved search to a user's organizations or from an organization to a user's account. - Removed a long-deprecated and ineffective settings `search.savedQueries` field. You can manage saved searches in a user's or organization's profile area (e.g., at `/user/searches`).	2024-07-15 20:12:34 +00:00
Robert Lin	d7ab268385	feat/dotcom: add Enterprise Portal auth proxy (#63652 ) Part of https://linear.app/sourcegraph/issue/CORE-211 This introduces authenticated proxies that allow dotcom site admins access to dev and production Enterprise Portal instances, authenticated with client credentials issued to the dotcom instance. The medium-term goal is to use this proxy so that we can use the existing subscriptions UI, backed by the new Enteprise Portal deployments (e.g. https://github.com/sourcegraph/sourcegraph/pull/63653, tracking issue: https://linear.app/sourcegraph/issue/CORE-100/enterprise-portal-migrate-away-from-dotcom-db-as-source-of-truth), until we have a dedicated UI for Enterprise Portal (https://linear.app/sourcegraph/project/kr-p-enterprise-portal-user-interface-dadd5ff28bd8) This is required until we ship https://linear.app/sourcegraph/project/kr-p1-streamlined-role-assignment-via-sams-and-entitle-2f118b3f9d4c/overview, which will allow SAMS to be the source-of-truth for who is a site admin in Sourcegraph.com. Once we have that information, we can use the user's SAMS session directly in Enterprise Portal to authorize access to Enterprise Portal data. ## Test plan Set up `dev-private` with dev credentials: https://github.com/sourcegraph/dev-private/pull/101 `sg start dotcom`, create a personal access token, and try to make ConnectRPC requests matching the spec to the new endpoints: ```sh # Local curl --header "Content-Type: application/json" --header "authorization: token sgp_local_..." --data '{"filters":[{"filter":{"is_archived":false}}]}' -v \ https://sourcegraph.test:3443/.api/enterpriseportal/local/enterpriseportal.subscriptions.v1.SubscriptionsService/ListEnterpriseSubscriptions # Dev curl --header "Content-Type: application/json" --header "authorization: token sgp_local_..." --data '{"filters":[{"filter":{"is_archived":false}}]}' -v \ https://sourcegraph.test:3443/.api/enterpriseportal/dev/enterpriseportal.subscriptions.v1.SubscriptionsService/ListEnterpriseSubscriptions ``` Note that the URL path after `/.api/enterpriseportal/dev/`, i.e. `/enterpriseportal.subscriptions.v1.SubscriptionsService/ListEnterpriseSubscriptions`, and the shape of the parameters, are all the same as if you curl'd the Enterprise Portal API directly, per the Connect protocol: https://connectrpc.com/docs/protocol/ Both local and dev reach out to the existing SAMS dev deployment for credentials, so the `dev-private` credentials work OOTB for both. --------- Co-authored-by: Andre Eleuterio <andreeleuterio@users.noreply.github.com>	2024-07-09 13:46:59 -07:00
Stefan Hengl	1af563b614	batches: use "keyword" as default pattern type (#63613 ) This is part of the Keyword GA Project. Batch Changes uses Sourcegraph queries to define the list of repositories on which the batch change will run. With this change we default to pattern type "keyword" instead of "standard". To make this a backward compatible change, we also introduce a version identifier to batch specs. Authors can specify `version: 2` in the spec, in which case we default to pattern type "keyword". Existing specs (without a specified version) and specs with `version: 1` will keep using pattern type "standard". Notes: - Corresponding doc update [PR](https://github.com/sourcegraph/docs/pull/477) - We don't have a query input field, but instead the query is defined in a batch spec YAML. It didn't feel right to edit the YAML and append "patternType: " on save, which is what we do for Code Monitors and Insights. - I misuse the pattern type query parameter to effectively override the version. Once we introduce "V4" we should come back here and clean up. I left a TODO in the code. Test plan: - New and updated unit tests - manual testing - new batch changes use `version: 2` by default. - using an unsupported version returns an error - I ran various "on:" queries to verify that version 2 uses keyword search and version 1 uses standard search.	2024-07-09 10:35:01 +02:00
Stephen Gutekanst	12b0e4e233	site config: minor fixes, make modelConfiguration enable new backend models API, initial self-hosted model config (#63697 ) These commits do a few things: --- 46b1303e62ea7e01ba6a441cc55bbe4c166ef5ce corrects a few minor mistakes with the new site config which I introduced in #63654 - namely fixing `examples` entries and nullability in a few cases. Nothing controversial here, just bug fixes. --- 750b61e7dfa661338c9b40042087aed8e795f900 makes it so that the `/.api/client-config` endpoint returns `"modelsAPIEnabled": true,` if `"modelConfiguration"` is set in the site config. For context, `"modelConfiguration"` is a new site config field, which is not used anywhere before this PR, and has this description: > BETA FEATURE, only enable if you know what you are doing. If set, Cody will use the new model configuration system and ignore the old 'completions' site configuration entirely. I will send a change to the client logic next so that it uses this `modelsAPIEnabled` field instead of the client-side feature flag `dev.useServerDefinedModels`. --- Finally, f52fba342dd2e62a606b885802f7f6bc37f4f4ac and bde67d57c39f4566dc9287f8793cb5ffd25955b3 make a few site config changes that @chrsmith and I discussed to enable Self-hosted models support. Specifically, it makes it possible to specify the following configuration in the site config: ``` // Setting this field means we are opting into the new Cody model configuration system which is in beta. "modelConfiguration": { // Disable use of Sourcegraph's servers for model discovery "sourcegraph": null, // Configure the OpenAI-compatible API endpoints that Cody should use to provide // mistral and bigcode (starcoder) models. "providerOverrides": [ { "displayName": "Mistral", "id": "mistral", "serverSideConfig": { "type": "openaicompatible", "endpoint": "...", "accessToken": "...", }, }, { "displayName": "Bigcode", "id": "bigcode", "serverSideConfig": { "type": "openaicompatible", "endpoint": "...", "accessToken": "...", }, }, ], // Configure which exact mistral and starcoder models we want available "modelOverridesRecommendedSettings": [ "bigcode::v1::starcoder2-7b", "mistral::v1::mixtral-8x7b-instruct" ], // Configure which models Cody will use by default "defaultModels": { "chat": "mistral::v1::mixtral-8x7b-instruct", "fastChat": "mistral::v1::mixtral-8x7b-instruct", "codeCompletion": "bigcode::v1::starcoder2-7b", } } ``` Currently this site config is not actually used, so configuring Sourcegraph like this should not be done today, but this will be in a future PR by me. @chrsmith one divergence from what we discussed.. me and you had planned to support this: ``` "modelOverrides": [ { "bigcode::v1::starcoder2-7b"": { "useRecommendSettings": true, }, "mistral::v1::mixtral-8x22b-instruct": { "useRecommendSettings": true, }, } ], ``` However, being able to specify `"useRecommendSettings": true,` inside of a `ModelOverride` in the site configuration means that all other `ModelOverride` fields (the ones we are accepting as recommended settings) must be optional, which seems quite bad and opens up a number of misconfiguration possibilities. Instead, I opted to introduce a new top-level field for model overrides _with recommended settings_, so the above becomes this instead: ``` "modelOverridesRecommendedSettings": [ "bigcode::v1::starcoder2-7b", "mistral::v1::mixtral-8x7b-instruct" ], ``` This has the added benefit of making it impossible to set both `"useRecommendSettings": true,` and other fields. I will make it a site config error (prevents admins from saving configuration) to specify the same model in both `modelOverrides` and `modelOverridesRecommendedSettings` in a future PR. --- ## Test plan Doesn't affect users yet. Careful review. ## Changelog N/A --------- Signed-off-by: Stephen Gutekanst <stephen@sourcegraph.com>	2024-07-08 16:53:05 -07:00
Stephen Gutekanst	a1864daad2	add new "modelConfiguration" schema to site config (#63654 ) ## User-facing impacts This PR adds a new top-level field to the site configuration `"modelConfiguration"` which is intended to replace the old `"completions"` field. For now, this is 100% opt-in (beta feature) and customers should only use this new field if they have talked with us to confirm it should work in their case. Today, this configuration is unused, but in future PRs it will actually be used (planned for the 5.5.0 release.) Additionally, `"modelConfiguration"` acts as a feature-flag. If it is set and not `null`, then Cody will enable this whole new model configuration system end-to-end which involves many new components: * Clients will make use of a new `/.api/modelconfig/supported-models.json` endpoint to query which models the server has available. * Cody will respect this new site configuration, discovering new models from Sourcegraph's servers without an upgrade by default, etc. * Clients will enable the "select an LLM model" dropdown menu for enterprise customers. * The old `"completions"` configuration, if present, will be ignored if `"modelConfiguration"` is set. ## Implementation notes This schema mirrors [the new model configuration schema](https://github.com/sourcegraph/sourcegraph/tree/main/internal/modelconfig/types) that Chris and myself have been working on tirelessly to enable a myriad of use-cases in the near future, including enabling customers to get support for new models without upgrading Sourcegraph, enabling multiple models / dropdown model selector in enterprise, improved self-hosted model support, and more. The translation is pretty much 1:1, with a few notable aspects: * I broke out [`GenericProviderConfig`](https://github.com/sourcegraph/sourcegraph/blob/main/internal/modelconfig/types/configuration.go#L49-L73) into distinct types for each provider. * I represent `ClientSideProviderConfig` and `ClientSideModelConfig` _objects_ (specify multiple fields) * I represent `ServerSideProviderConfig` and `ServerSideModelConfig` as _tagged unions_ / _discriminated unions_, i.e. where you must write a `{"type": "foo"}` field as part of the object. My next PR will be to handle conversion from the site config types to the `modelconfig/types`. ## Test plan No impact to product behavior yet, nothing to test. ## Changelog Changelog entry will come later with proper docs link, when we are ready for customers to use this. --------- Signed-off-by: Stephen Gutekanst <stephen@sourcegraph.com> Co-authored-by: Chris Smith <chrsmith@users.noreply.github.com>	2024-07-06 05:03:12 +00:00
Stephen Gutekanst	239f42947b	Add a better Cody client server-sent configuration mechanism (#63591 ) Signed-off-by: Stephen Gutekanst <stephen@sourcegraph.com>	2024-07-03 22:57:31 +00:00
Julie Tibshirani	6c6448a127	Search: remove keyword toggle (#63584 ) This PR removes the keyword search toggle as part of making the feature GA. It removes the keyword search toggle and popover, but keeps the "call to action" on the search landing page, Main changes: * Remove toggle on search results page * Stop checking `experimentalFeatures.keywordSearch`. (Instead, users should set `search.defaultPatternType: standard`) * Remove `LegacyToggles` and all references. This duplicated `Toggles` and is no longer needed since we unified the implementations. Closes SPLF-111	2024-07-03 10:53:33 -07:00
Varun Gandhi	f8ab07d1fe	chore: Set scipBasedAPIs to off by default (#63575 ) This new feature is WIP, so put it behind an off-by-default feature flag. As of now (Jul 2 2024), the feature flag is enabled on S2, Sourcegraph.com and in dev environments.	2024-07-02 18:15:37 +08:00
Varun Gandhi	e5d6a66422	chore: Add feature flag for new SCIP-based GraphQL APIs (#63565 ) Makes partial progress towards https://linear.app/sourcegraph/issue/GRAPH-721 After this, I'll make necessary changes to the various configs to enable this feature flag for dev, S2 and Sourcegraph.com. After that, I'll change the default to be `false`.	2024-07-01 17:01:12 +08:00
Ara	f5d5deceb0	Fix azure completions api (#63491 ) [Linear Issue ](https://linear.app/sourcegraph/issue/CODY-2586/fix-completions-models-api-for-azure-to-use-the-right-model-with-the) The purpose of this PR is to make a backwords compatible solution such that the completions logic in our codebase for azure supports both the completions API(which is old) and also supports the chat/completions API which is new. This way we can use models from both of them with autocomplete. NOTe: Since we can't figure out which model we are using because azure has the deployment name instead of model name and because of that we can't decide which API to use for which model we try with both of the APIs and then the API that works is cached for that model and then we used the cached API logic to choose the api to make subsequent completion calls this way we can choose either of the APIs and not have added latency with completions. ## Test plan I used the azure keys to try out different deployment models that we have both with the old and the new api. Old API -> Completions (gpt-3.5-turbo-instruct, gpt-3.5-turbo(301), gpt-3.5-turbo(613)) New API -> Chat Completions(gpt-3.5-turbo(301), gpt-4o, gpt-3.5-turbo(613), gpt-3.5-turbo-16k) NOTE both of the set of models work seamless with this PR. <!-- REQUIRED; info at https://docs-legacy.sourcegraph.com/dev/background-information/testing_principles --> ## Changelog <!-- OPTIONAL; info at https://www.notion.so/sourcegraph/Writing-a-changelog-entry-dd997f411d524caabf0d8d38a24a878c -->	2024-06-28 19:41:17 +00:00
Ara	141d2e0cc4	Add Support for Counting Tokens for Azure Code and Update in Redis (#63100 ) Description: This PR introduces support for counting tokens within the Azure code and updating these counts in Redis. The token counting logic is embedded directly in the Azure code rather than using a standardized point for all token counting logic. Reasoning: • Azure does not currently support obtaining token usage from their streaming endpoint, unlike OpenAI. • To enable immediate functionality, the token counting logic is placed within the Azure code itself. • The implementation supports GPT-4o. Future Considerations: • When Azure eventually adds support for token usage from the streaming endpoint, we will migrate to using Azure’s built-in capabilities. • This will ensure full utilization of Azure OpenAI features as they achieve parity with OpenAI. Changes: • Added token counting logic to the Azure code. • Updated Redis with the token counts. Testing: • Verified the implementation works with GPT-4o. Conclusion: This is a temporary solution to enable token counting in Azure. We will adapt our approach as Azure enhances its feature set to include token usage from their streaming endpoint. ## Test plan Tested locally with debugger <!-- All pull requests REQUIRE a test plan: https://docs-legacy.sourcegraph.com/dev/background-information/testing_principles --> ## Changelog <!-- 1. Ensure your pull request title is formatted as: $type($domain): $what 2. Add bullet list items for each additional detail you want to cover (see example below) 3. You can edit this after the pull request was merged, as long as release shipping it hasn't been promoted to the public. 4. For more information, please see this how-to https://www.notion.so/sourcegraph/Writing-a-changelog-entry-dd997f411d524caabf0d8d38a24a878c? Audience: TS/CSE > Customers > Teammates (in that order). Cheat sheet: $type = chore\|fix\|feat $domain: source\|search\|ci\|release\|plg\|cody\|local\|... --> <!-- Example: Title: fix(search): parse quotes with the appropriate context Changelog section: ## Changelog - When a quote is used with regexp pattern type, then ... - Refactored underlying code. -->	2024-06-28 12:37:53 +00:00
Camden Cheek	6e57dfec13	Codenav: use new occurrences API for symbol definitions (#63217 ) This integrates the new occurrences API into the Svelte webapp. This fixes a number of issues where the syntax highlighting data is not an accurate way to determine hoverable tokens. It is currently behind the setting `experimentalFeatures.enablePreciseOccurrences`	2024-06-28 00:26:17 +00:00
Stefan Hengl	2f3c550594	search: remove "lucky" from default pattern types (#63486 ) "lucky" was an experimental pattern type we added about 2 years ago. Judging from the git history and the current code, it was at some point replaced by "smart search" and "search mode", which we also plan to remove soon. See https://github.com/sourcegraph/sourcegraph/pull/43140 for more context Test plan: CI	2024-06-27 16:27:17 +02:00
Erik Seliger	83d0f6876c	dotcom: Remove on-demand cloning of repositories (#63321 ) Historically, sourcegraph.com has been the only instance. It was connected to GitHub.com and GitLab.com only. Configuration should be as simple as possible, and we wanted everyone to try it on any repo. So public repos were added on-demand when browsed from these code hosts. Since, dotcom is no longer the only instance, and this is a special case that only exists for sourcegraph.com. This causes a bunch of additional complexity and various extra code paths that we don't test well enough today. We want to make dotcom simpler to understand, so we've made the decision to disable that feature, and instead we will maintain a list of repositories that we have on the instance. We already disallowed several repos half a year ago, by restricting size of repos with few stars heavily. This is basically just a continuation of that. In the diff, you'll mostly find deletions. This PR does not do much other than removing the code paths that were only enabled in dotcom mode in the repo syncer, and then removes code that became unused as a result of that. ## Test plan Ran a dotcom mode instance locally, it did not behave differently than a regular instance wrt. repo cloning. We will need to verify during the rollout that we're not suddenly hitting code paths that don't scale to the dotcom size. ## Changelog Dotcom no longer clones repos on demand.	2024-06-26 14:53:14 -07:00
Vova Kulikov	a61c881e47	[React]: Add initial usage of the new web worker-based cody web chat (#62792 ) Part of https://github.com/sourcegraph/sourcegraph/issues/62448 Linear issue [SRCH-573](https://linear.app/sourcegraph/issue/SRCH-573/integrate-cody-web-package-into-the-sourcegraph-ui) This is highly experimental usage of the new (not currently merged but published in NPM `cody-web-experimental`) package ## How to run it - (Optional) if you previously linked any local packages make sure they don't exist in your node_modules anymore, `rm -rf node_modules` in the root then `pnpm install` - Run standard `sg start web-standalone` - Turn on `newCodyWeb: true` in your `experimentalFeatures` ## How to run it locally with prototype PR in Cody repository - Open Cody repository on the `vk/integrate-cody-web-chat-2` branch - At the root of the repo, run `pnpm install` to make sure you're up to date with all of the dependencies. - Go to the web package (`cd web`) - Build it with `pnpm build` - Create a global link with `pnpm link --global` (Ignore the warning message about no binary) - Open sourcegraph/sourcegraph repository on this PR branch - Make sure you are in the root of the repo. - Run `pnpm link --global cody-web-experimental` - Run `sg start web-standalone` to bundle the web app and launch an instance that uses S2 for the backend. You'll need to create a login on S2 that is not federated by GitHub. - Turn on `newCodyWeb: true` in your `experimentalFeatures` - Have fun experimenting! ## Test plan - Check that old version of Cody has got no regressions	2024-06-26 12:13:29 -03:00
Taras Yemets	345a06abb1	chore(plg): add useEmbeddedUI site config param (#63442 ) Adds `dotcom.codyProConfig.useEmbeddedUI` site config param. This param defines whether the Cody Pro subscription and team management UI should be served from the connected instance running in the dotcom mode. The default value is `false`. This change allows us to enable the SSC proxy on the instance without enabling the new embedded Cody Pro UI. Previously whether the embedded Cody Pro UI is enabled was defined by the `dotcom.codyProConfig` being set, which prevented us from enabling the SSC proxy without enabling the embedded UI: > Whether the SSC proxy is enabled is [defined based on `dotcom.codyProConfig`](`41fb56d619/cmd/frontend/internal/ssc/ssc_proxy.go (L227-L231)`) being set in the site config. This value is also partially [propagated](`41fb56d619/cmd/frontend/internal/app/jscontext/jscontext.go (L481)`) to the frontend via jscontext. And the frontend [uses this value](`41fb56d619/client/web/src/cody/util.ts (L8-L18)`) to define whether to use new embedded UI or not. For more details see [this Slack thread](https://sourcegraph.slack.com/archives/C05PC7AKFQV/p1719010292837099?thread_ts=1719000927.962429&cid=C05PC7AKFQV). <!-- 💡 To write a useful PR description, make sure that your description covers: - WHAT this PR is changing: - How was it PREVIOUSLY. - How it will be from NOW on. - WHY this PR is needed. - CONTEXT, i.e. to which initiative, project or RFC it belongs. The structure of the description doesn't matter as much as covering these points, so use your best judgement based on your context. Learn how to write good pull request description: https://www.notion.so/sourcegraph/Write-a-good-pull-request-description-610a7fd3e613496eb76f450db5a49b6e?pvs=4 --> ## Test plan - CI - Tested manually: - Run Sourcegraoh instance locally in dotcom mode - Set `dotcom.codyProConfig` in the site config - Type `context. frontendCodyProConfig` that it returns the [correct values from the site config](`184da4ce4a/cmd/frontend/internal/app/jscontext/jscontext.go (L711-L715)`) <!-- All pull requests REQUIRE a test plan: https://docs-legacy.sourcegraph.com/dev/background-information/testing_principles --> ## Changelog <!-- 1. Ensure your pull request title is formatted as: $type($domain): $what 2. Add bullet list items for each additional detail you want to cover (see example below) 3. You can edit this after the pull request was merged, as long as release shipping it hasn't been promoted to the public. 4. For more information, please see this how-to https://www.notion.so/sourcegraph/Writing-a-changelog-entry-dd997f411d524caabf0d8d38a24a878c? Audience: TS/CSE > Customers > Teammates (in that order). Cheat sheet: $type = chore\|fix\|feat $domain: source\|search\|ci\|release\|plg\|cody\|local\|... --> <!-- Example: Title: fix(search): parse quotes with the appropriate context Changelog section: ## Changelog - When a quote is used with regexp pattern type, then ... - Refactored underlying code. -->	2024-06-24 07:49:58 -07:00
Camden Cheek	db7a268c34	Chore: remove search console (#63322 ) The search console page is broken, is not used or maintained, and is only referenced by a series of blog posts years ago. We have product support to remove it.	2024-06-19 11:05:03 -06:00
Matthew Manela	92b8ffb8e1	fix(Source): Fix documentation URLs for code hosts help pages (#63274 ) It seems many of our doc links for code hosts are broken in production due to a url changed from external_services to code_hosts. I did a find an replace to update all the ones I could find.	2024-06-17 14:32:46 -04:00
Beatrix	f2590cbb36	Cody Gateway: Add Gemini models to PLG and Enterprise users (#63053 ) CLOSE https://github.com/sourcegraph/cody-issues/issues/211 & https://github.com/sourcegraph/cody-issues/issues/412 & https://github.com/sourcegraph/cody-issues/issues/412 UNBLOCK https://github.com/sourcegraph/cody/pull/4360 * Add support for Google Gemini AI models as chat completions provider * Add new `google` package to handle Google Generative AI client * Update `client.go` and `codygateway.go` to handle the new Google provider * Set default models for chat, fast chat, and completions when Google is the configured provider * Add gemini-pro to the allowed list <!-- 💡 To write a useful PR description, make sure that your description covers: - WHAT this PR is changing: - How was it PREVIOUSLY. - How it will be from NOW on. - WHY this PR is needed. - CONTEXT, i.e. to which initiative, project or RFC it belongs. The structure of the description doesn't matter as much as covering these points, so use your best judgement based on your context. Learn how to write good pull request description: https://www.notion.so/sourcegraph/Write-a-good-pull-request-description-610a7fd3e613496eb76f450db5a49b6e?pvs=4 --> ## Test plan <!-- All pull requests REQUIRE a test plan: https://docs-legacy.sourcegraph.com/dev/background-information/testing_principles --> For Enterprise instances using google as provider: 1. In your Soucegraph local instance's Site Config, add the following: ``` "accessToken": "REDACTED", "chatModel": "gemini-1.5-pro-latest", "provider": "google", ``` Note: You can get the accessToken for Gemini API in 1Password. 2. After saving the site config with the above change, run the following curl command: ``` curl 'https://sourcegraph.test:3443/.api/completions/stream' -i \ -X POST \ -H 'authorization: token $LOCAL_INSTANCE_TOKEN' \ --data-raw '{"messages":[{"speaker":"human","text":"Who are you?"}],"maxTokensToSample":30,"temperature":0,"stopSequences":[],"timeoutMs":5000,"stream":true,"model":"gemini-1.5-pro-latest"}' ``` 3. Expected Output: ``` ❯ curl 'https://sourcegraph.test:3443/.api/completions/stream' -i \ -X POST \ -H 'authorization: token <REDACTED>' \ --data-raw '{"messages":[{"speaker":"human","text":"Who are you?"}],"maxTokensToSample":30,"temperature":0,"stopSequences":[],"timeoutMs":5000,"stream":true,"model":"gemini-1.5-pro-latest"}' HTTP/2 200 access-control-allow-credentials: true access-control-allow-origin: alt-svc: h3=":3443"; ma=2592000 cache-control: no-cache content-type: text/event-stream date: Tue, 04 Jun 2024 05:45:33 GMT server: Caddy server: Caddy vary: Accept-Encoding, Authorization, Cookie, Authorization, X-Requested-With, Cookie x-accel-buffering: no x-content-type-options: nosniff x-frame-options: DENY x-powered-by: Express x-trace: d4b1f02a3e2882a3d52331335d217b03 x-trace-span: 728ec33860d3b5e6 x-trace-url: https://sourcegraph.test:3443/-/debug/jaeger/trace/d4b1f02a3e2882a3d52331335d217b03 x-xss-protection: 1; mode=block event: completion data: {"completion":"I","stopReason":"STOP"} event: completion data: {"completion":"I am a large language model, trained by Google. \n\nThink of me as","stopReason":"STOP"} event: completion data: {"completion":"I am a large language model, trained by Google. \n\nThink of me as a computer program that can understand and generate human-like text.","stopReason":"MAX_TOKENS"} event: done data: {} ``` Verified locally: ![image](https://github.com/sourcegraph/sourcegraph/assets/68532117/2e6c914d-7a77-4484-b693-16bbc394518c) #### Before Cody Gateway returns `no client known for upstream provider google` ```sh curl -X 'POST' -d '{"messages":[{"speaker":"human","text":"Who are you?"}],"maxTokensToSample":30,"temperature":0,"stopSequences":[],"timeoutMs":5000,"stream":true,"model":"google/gemini-1.5-pro-latest"}' -H 'Accept: application/json' -H 'Authorization: token $YOUR_DOTCOM_TOKEN' -H 'Content-Type: application/json' 'https://sourcegraph.com/.api/completions/stream' event: error data: {"error":"no client known for upstream provider google"} event: done data: { ``` ## Changelog <!-- 1. Ensure your pull request title is formatted as: $type($domain): $what 2. Add bullet list items for each additional detail you want to cover (see example below) 5. You can edit this after the pull request was merged, as long as release shipping it hasn't been promoted to the public. 6. For more information, please see this how-to https://www.notion.so/sourcegraph/Writing-a-changelog-entry-dd997f411d524caabf0d8d38a24a878c? Audience: TS/CSE > Customers > Teammates (in that order). Cheat sheet: $type = chore\|fix\|feat $domain: source\|search\|ci\|release\|plg\|cody\|local\|... --> <!-- Example: Title: fix(search): parse quotes with the appropriate context Changelog section: ## Changelog - When a quote is used with regexp pattern type, then ... - Refactored underlying code. --> Added support for Google as an LLM provider for Cody, with the following models available through Cody Gateway: Gemini Pro (`gemini-pro-latest`), Gemini 1.5 Flash (`gemini-1.5-flash-latest`), and Gemini 1.5 Pro (`gemini-1.5-pro-latest`).	2024-06-04 23:46:36 +00:00
Erik Seliger	7c8668c455	gitserver: Increase default GitLongCommandTimeout (#63043 ) This has historically been set to 1 hour. We've seen several reports of users running into the limit for clones of very large repositories, but we have seen no complaints of processes hanging for very long and clogging any queues. So it feels sensible to me to increase the default for this value to 2h. We might come back here later and decide that we don't really need a deadline here at all and instead hard-code a day or so to prevent infinite clogging, but let's see how far 2x gets us for now. Test plan: CI still passes.	2024-06-03 17:09:19 +02:00
Ara	8e37fcac26	Adding the User param to the site config so that it can be supported by Azure as an extra param (#62950 ) * Adding the User param to the site config so that it can be supported by Azure as an extra param * Adding the User param to the site config so that it can be supported by Azure as an extra param * Adding the User param to the site config so that it can be supported by Azure as an extra param * Adding the User param to the site config so that it can be supported by Azure as an extra param	2024-05-28 23:06:06 +00:00
Beatrix	838221e665	Rename smartContext to smartContextWindow (#62948 ) * Rename smartContext to smartContextWindow * Update CHANGELOG.md Co-authored-by: Kalan <51868853+kalanchan@users.noreply.github.com> --------- Co-authored-by: Kalan <51868853+kalanchan@users.noreply.github.com>	2024-05-28 10:00:34 -07:00
Varun Gandhi	4973440f33	fix: Make most syntaxHighlighting config fields optional (#62817 ) Previously, too many fields where required, which means you needed to specify unnecessary fields when trying to modify only a single field, such as mapping specific extensions to specific languages. Fixes https://linear.app/sourcegraph/issue/GRAPH-612	2024-05-22 06:25:33 +00:00
Beatrix	2cb111280b	feat/cody: new site config for smart context (#62802 ) * feat(completions): add smart context site config - Add `SmartContext` field to `CompletionsConfig` struct - Implement `SmartContext()` method in `codyLLMConfigurationResolver` - Set default `SmartContext` value to "enabled" if not provided - Update schema and documentation to describe `SmartContext` feature * Update unit test * Add Changelog entry	2024-05-21 10:45:27 -07:00
David Veszelovszki	376a737d61	Make SSC base url dynamic (#62790 ) * Add config item, get it to the front end * Use config on the front end * Send team=1 if the team button is clicked * Unrelated: Event logging cleanup	2024-05-21 09:58:08 +00:00
Philipp Spiess	1e3cb2bc68	Change chat defaults to Sonnet and autocomplete defaults to StarCoder or Haiku (#62757 )	2024-05-17 14:04:10 +02:00
Michael Bahr	e85028b8bd	fix: update links for dev docs (#62758 ) * fix: license checker info is in docs-legacy * fix: update remaining dev links	2024-05-17 13:47:34 +02:00
Noah S-C	9b6ba7741e	bazel: transcribe test ownership to bazel tags (#62664 )	2024-05-16 15:51:16 +01:00
Namit Chandwani	111503d2ad	feat: make display limit configurable from user settings (#60761 ) - On the frontend: - Added a new field named `search.displayLimit` to the User settings - Started using the `search.displayLimit` value while performing stream search - On the backend: - No changes --------- Co-authored-by: Stefan Hengl <stefan@sourcegraph.com>	2024-05-14 13:04:34 -07:00
Anton Sviridov	2b656eb107	Move syntactic indexing toggle to site config (#62592 )	2024-05-13 11:31:22 +01:00
Petri-Johan Last	710c734021	Add 'internal' case for repositoryQuery (#62266 )	2024-05-02 15:32:39 +02:00
Camden Cheek	708c091aa3	Code monitors: add some levers for tuning resource usage (#62254 ) This adds a couple of configuration options which will allow us to tune the cost of code monitors in the event that we are seeing issues. It does not change the default values.	2024-05-01 08:51:37 -06:00
Chris Smith	3d96b683c6	Add new `dotcom.codyProConfig` site configuration object (#62255 ) * Add 'codyProConfig' and 'frontendCodyProConfig' * Remove showEmbeddedCodyProUI and related setting * Address PR feedback	2024-04-30 19:17:14 +00:00
Erik Seliger	24e8505019	chore: Completely sunset qdrant (#62018 ) This removes qdrant from this codebase entirely. All the docker images, dependencies, (dead) usage in code. My understanding is that we don't use this feature and never properly rolled it out. Test plan: CI passes and code review from owners.	2024-04-22 18:00:57 +02:00
Taras Yemets	01bc0813d8	Add Cody context filters to the site config and expose them via GraphQL (#61101 )	2024-04-15 23:38:49 +03:00
Keegan Carruthers-Smith	65ebdfb8af	schema: remove non-determinism from TestSchemaValidationUUID (#61728 ) Noticed this test failing once locally and worked out the source of the flakiness. Could reproduce with the below test plan. Test Plan: "go test -race -count=100 ./schema" passes	2024-04-09 15:50:30 +00:00

1 2 3 4 5 ...

1032 Commits