mirror of https://github.com/sourcegraph/sourcegraph.git synced 2026-02-06 17:31:43 +00:00

History

Ara 141d2e0cc4 Add Support for Counting Tokens for Azure Code and Update in Redis (#63100 ) Description: This PR introduces support for counting tokens within the Azure code and updating these counts in Redis. The token counting logic is embedded directly in the Azure code rather than using a standardized point for all token counting logic. Reasoning: • Azure does not currently support obtaining token usage from their streaming endpoint, unlike OpenAI. • To enable immediate functionality, the token counting logic is placed within the Azure code itself. • The implementation supports GPT-4o. Future Considerations: • When Azure eventually adds support for token usage from the streaming endpoint, we will migrate to using Azure’s built-in capabilities. • This will ensure full utilization of Azure OpenAI features as they achieve parity with OpenAI. Changes: • Added token counting logic to the Azure code. • Updated Redis with the token counts. Testing: • Verified the implementation works with GPT-4o. Conclusion: This is a temporary solution to enable token counting in Azure. We will adapt our approach as Azure enhances its feature set to include token usage from their streaming endpoint. ## Test plan Tested locally with debugger <!-- All pull requests REQUIRE a test plan: https://docs-legacy.sourcegraph.com/dev/background-information/testing_principles --> ## Changelog <!-- 1. Ensure your pull request title is formatted as: $type($domain): $what 2. Add bullet list items for each additional detail you want to cover (see example below) 3. You can edit this after the pull request was merged, as long as release shipping it hasn't been promoted to the public. 4. For more information, please see this how-to https://www.notion.so/sourcegraph/Writing-a-changelog-entry-dd997f411d524caabf0d8d38a24a878c? Audience: TS/CSE > Customers > Teammates (in that order). Cheat sheet: $type = chore\|fix\|feat $domain: source\|search\|ci\|release\|plg\|cody\|local\|... --> <!-- Example: Title: fix(search): parse quotes with the appropriate context Changelog section: ## Changelog - When a quote is used with regexp pattern type, then ... - Refactored underlying code. -->		2024-06-28 12:37:53 +00:00
..
branded	fix(search): Token decoration in keyword-enabled query input (#63543 )	2024-06-28 14:30:16 +02:00
browser	chore(ci): remove Percy visual tests (#63515 )	2024-06-27 16:20:06 +02:00
build-config	[React]: Add initial usage of the new web worker-based cody web chat (#62792 )	2024-06-26 12:13:29 -03:00
client-api	v2t: add v2 telemetry to the client/shared folder (#62586 )	2024-06-03 16:34:28 -07:00
codeintellify	Migrate deprecated rxjs functions/methods (#61222 )	2024-04-08 11:23:34 +02:00
cody-context-filters-test-dataset	Create a shared Cody Ignore dataset (#61968 )	2024-05-09 13:18:35 +00:00
cody-shared	Rename smartContext to smartContextWindow (#62948 )	2024-05-28 10:00:34 -07:00
cody-ui	Cody web: Bring back old packages from git history (#61376 )	2024-04-08 14:21:41 +02:00
common	chore: Bump go-enry and Zoekt to handle new languages (#63281 )	2024-06-20 22:19:39 +08:00
eslint-plugin-wildcard	chore: upgrade to Aspect CLI 5.8.5 (#57961 )	2023-10-30 17:01:58 +02:00
extension-api	Docs: update links to point to new site (#60381 )	2024-02-13 00:23:47 +00:00
extension-api-types	use @typescript-eslint projectService for faster eslint (#57851 )	2023-10-24 01:40:40 +00:00
http-client	reapply "switch from jest to vitest for faster, simpler tests (#57886 )" (#58145 )	2023-11-07 12:00:18 +02:00
jetbrains	looser eslint rules (#63511 )	2024-06-27 08:42:51 +00:00
observability-client	reapply "switch from jest to vitest for faster, simpler tests (#57886 )" (#58145 )	2023-11-07 12:00:18 +02:00
observability-server	reapply "switch from jest to vitest for faster, simpler tests (#57886 )" (#58145 )	2023-11-07 12:00:18 +02:00
shared	Chore: refactoring occurrence indexing (#63473 )	2024-06-28 02:46:20 +00:00
storybook	fix: update links for dev docs (#62758 )	2024-05-17 13:47:34 +02:00
template-parser	reapply "switch from jest to vitest for faster, simpler tests (#57886 )" (#58145 )	2023-11-07 12:00:18 +02:00
testing	chore(bazel): enable rules_esbuild sandbox with object-inspect workaround (#61969 )	2024-06-05 15:34:29 +01:00
vscode	fix(search): VSCode Search extension: bring back matched lines in search results. (#63524 )	2024-06-27 13:24:51 -06:00
web	fix(search): Token decoration in keyword-enabled query input (#63543 )	2024-06-28 14:30:16 +02:00
web-sveltekit	Add Support for Counting Tokens for Azure Code and Update in Redis (#63100 )	2024-06-28 12:37:53 +00:00
wildcard	looser eslint rules (#63511 )	2024-06-27 08:42:51 +00:00
BUILD.bazel	Added ts_projects for storybook files in client/* (#59400 )	2024-01-09 10:37:53 -08:00
README.md	use esbuild for client/web builds (#57365 )	2023-10-23 10:59:06 -07:00

README.md

Frontend packages

List

web: The web application deployed to http://sourcegraph.com/
browser: The Sourcegraph browser extension adds tooltips to code on different code hosts.
vscode: The Sourcegraph VS Code extension.
extension-api: The Sourcegraph extension API types for the Sourcegraph extensions. Published as sourcegraph.
extension-api-types: The Sourcegraph extension API types for client applications that embed Sourcegraph extensions and need to communicate with them. Published as @sourcegraph/extension-api-types.
sandboxes: All demos-mvp (minimum viable product) for the Sourcegraph web application.
shared: Contains common TypeScript/React/SCSS client code shared between the browser extension and the web app. Everything in this package is code-host agnostic.
branded: Contains React components and implements the visual design language we use across our web app and e.g. in the options menu of the browser extension. Over time, components from shared and branded packages should be moved into the wildcard package.
wildcard: Package that encapsulates storybook configuration and contains our Wildcard design system components. If we're using a component in two or more different areas (e.g. web-app and browser-extension) then it should live in the wildcard package. Otherwise the components should be better colocated with the code where they're actually used.
search: Search-related code that may be shared between all clients, both branded (e.g. web, VS Code extension) and unbranded (e.g. browser extension)
storybook: Storybook configuration.

Further migration plan

Fix circular dependency in TS project-references graph wildcard package should not rely on web and probably shared, branded too. Ideally it should be an independent self-contained package.
Decide on package naming and update existing package names. Especially it should be done for a shared package because we have multiple shared folders inside of other packages. It's hard to understand from where dependency is coming from and it's not possible to refactor import paths using find-and-replace.
Investigate if we can painlessly switch to npm workspaces.
Content of packages shared and branded should be moved to wildcard and refactored using the latest FE rules and conventions. Having different packages clearly communicates the migration plan. Developers first should look for components in the wildcard package and then fall-back to legacy packages if wildcard doesn't have the solution to their problem yet.
shared contains utility functions, types, polyfills, etc which is not a part of the Wildcard component library. These modules should be moved into utils package and other new packages: e.g. api for GraphQL client and type generators, etc.
Packages should use package name (e.g. @sourcegraph/wildcard) for imports instead of the relative paths (e.g. ../../../../wildcard/src/components/Markdown) to avoid long relative-paths and make dependency graph between packages clear. (Typescript will warn if packages have circular dependencies). It's easy to refactor such isolated packages, extract functionality into new ones, or even into new repositories.