Add Support for Counting Tokens for Azure Code and Update in Redis #63100

arafatkatze · 2024-06-05T13:29:38Z

Description:

This PR introduces support for counting tokens within the Azure code and updating these counts in Redis. The token counting logic is embedded directly in the Azure code rather than using a standardized point for all token counting logic.

Reasoning:

•	Azure does not currently support obtaining token usage from their streaming endpoint, unlike OpenAI.
•	To enable immediate functionality, the token counting logic is placed within the Azure code itself.
•	The implementation supports GPT-4o.

Future Considerations:

•	When Azure eventually adds support for token usage from the streaming endpoint, we will migrate to using Azure’s built-in capabilities.
•	This will ensure full utilization of Azure OpenAI features as they achieve parity with OpenAI.

Changes:

•	Added token counting logic to the Azure code.
•	Updated Redis with the token counts.

Testing:

•	Verified the implementation works with GPT-4o.

Conclusion:

This is a temporary solution to enable token counting in Azure. We will adapt our approach as Azure enhances its feature set to include token usage from their streaming endpoint.

Test plan

Tested locally with debugger

Changelog

arafatkatze · 2024-06-05T13:33:16Z

schema/site.schema.json

+            "gpt-4"
+          ]
+        },
+        "azureCompletionModel": {


I hate adding extra parameters to the siteconfig and requestparams. However, I had to add these parameters because Azure does not support naming the model directly. Instead, you provide the deployment name. The deployment name can be different from the model name. Because of this discrepancy, I had to introduce extra config parameters to specify the chat model name and the completion model name.

This necessity arises because Azure doesn’t allow retrieval of the model name straightforwardly. The deployment name, which is used in requests, can map to any underlying model. For accurate token counting, the correct model name is required. Although there are ways to retrieve the model name from the deployment, they involve complex logic. Therefore, this approach, despite being somewhat inelegant, is the simplest solution given the constraints.

arafatkatze · 2024-06-05T13:57:08Z

go.mod

-	github.com/pkoukk/tiktoken-go v0.1.6
-	github.com/pkoukk/tiktoken-go-loader v0.0.1
+	github.com/pkoukk/tiktoken-go v0.1.7
+	github.com/pkoukk/tiktoken-go-loader v0.0.2-0.20240522064338-c17e8bc0f699


Had to update to support gpt4o

slimsag · 2024-06-06T03:17:53Z

Will review this tomorrow, apologies for the delay

arafatkatze · 2024-06-06T16:34:54Z

internal/completions/client/azureopenai/openai.go

-				logger.Warn("Failed to count tokens with the token manager %w ", log.Error(err))
+				logger.Warn("Failed to count output tokens with the token manager %w ", log.Error(err))
+			}
+			if inputTokens != 0 && outputTokens != 0 {


Check again

arafatkatze · 2024-06-06T20:01:12Z

Added the validation check also in this commit

slimsag · 2024-06-10T20:52:55Z

internal/completions/client/azureopenai/openai.go

+			inputTokens, err := NumTokensFromAzureOpenAiMessages(requestParams.Messages, requestParams.AzureCompletionModel)
+			if err != nil {
+				logger.Warn("Failed to count input tokens with the token manager %w ", log.Error(err))
+				return err


Are you positive we want these cases returning token counting errors? I'm unsure.

On the one hand, if token counting is failing that is bad. But on the other hand, if errors are returned here because token counting is failing... then Cody is entirely 100% broken for users.

Seems to me that we're better logging these errors and simply not recording token counts should there be an issue here.

slimsag · 2024-06-10T20:55:48Z

internal/conf/validate_custom.go

+
+	if hasAzure && !(hasAzureChatModel && hasAzureCompletionModel) {
+		invalid(NewSiteProblem(`when using azure-openai provider its mandatory to set both completions.azureChatModel and completions.azureCompletionModel for proper LLM Token usage`))
+	}


Ideally no validation logic would live in this centralized function, as then it becomes too large.

You should add a contributed validator instead; e.g. inside an init function in azureopenai/openai.go. For example like this

slimsag · 2024-06-10T20:56:33Z

two minor comments, will be LGTM once those are addressed

arafatkatze added 3 commits June 4, 2024 18:12

Adding manually calculated tokenization for azure

e1cc7f2

Adding new images with differing siteconfig

6675b63

Adding new tiktoken for the GPT 4 o testing

53cf973

cla-bot bot added the cla-signed label Jun 5, 2024

arafatkatze changed the title ~~Add azure tokenization and token counting support~~ Add Support for Counting Tokens for Azure Code and Update in Redis Jun 5, 2024

arafatkatze requested review from slimsag June 5, 2024 13:30

arafatkatze commented Jun 5, 2024

View reviewed changes

arafatkatze added 2 commits June 5, 2024 15:34

Adding new tiktoken for the GPT 4 o testing

8ae04e3

Adding new tiktoken for the GPT 4 o testing

add3b8f

arafatkatze commented Jun 5, 2024

View reviewed changes

arafatkatze commented Jun 6, 2024

View reviewed changes

Adding new tiktoken for the GPT 4 o testing

52b92b9

arafatkatze force-pushed the add-azure-tokenization branch from 561199a to 52b92b9 Compare June 6, 2024 20:04

slimsag reviewed Jun 10, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Support for Counting Tokens for Azure Code and Update in Redis #63100

Add Support for Counting Tokens for Azure Code and Update in Redis #63100

arafatkatze commented Jun 5, 2024 •

edited

arafatkatze Jun 5, 2024

arafatkatze Jun 5, 2024

slimsag commented Jun 6, 2024

arafatkatze Jun 6, 2024

arafatkatze commented Jun 6, 2024 •

edited

slimsag Jun 10, 2024

slimsag Jun 10, 2024 •

edited

slimsag commented Jun 10, 2024

Add Support for Counting Tokens for Azure Code and Update in Redis #63100

Are you sure you want to change the base?

Add Support for Counting Tokens for Azure Code and Update in Redis #63100

Conversation

arafatkatze commented Jun 5, 2024 • edited

Test plan

Changelog

arafatkatze Jun 5, 2024

Choose a reason for hiding this comment

arafatkatze Jun 5, 2024

Choose a reason for hiding this comment

slimsag commented Jun 6, 2024

arafatkatze Jun 6, 2024

Choose a reason for hiding this comment

arafatkatze commented Jun 6, 2024 • edited

slimsag Jun 10, 2024

Choose a reason for hiding this comment

slimsag Jun 10, 2024 • edited

Choose a reason for hiding this comment

slimsag commented Jun 10, 2024

arafatkatze commented Jun 5, 2024 •

edited

arafatkatze commented Jun 6, 2024 •

edited

slimsag Jun 10, 2024 •

edited