maxEmbeddingsInput() should fetch token limits from API instead of hardcoded value [#3570539]

Problem/Motivation

The maxEmbeddingsInput() method returns a hardcoded value of 1024 tokens with a TODO comment

  public function maxEmbeddingsInput($model_id = ''): int {                                                                                                                                                                                   
    // @todo this is playing safe. Ideally, we should provide real number per model.                                                                                                                                                          
    return 1024;                                                                                                                                                                                                                              
  }

This has two problems:

1. Incorrect value: The mistral-embed model supports 8192 tokens, not 1024. This unnecessarily limits text that can be embedded.
2. Not using the $model_id parameter: The method receives a model ID but ignores it. The Mistral API returns max_context_length for each model via the /v1/models endpoint, which should be used to return accurate limits per model.

Steps to reproduce

0. have ai_provider_mistral module installed and setup
1. Call $provider->maxEmbeddingsInput('mistral-embed')
2. Observe it returns 1024 regardless of model
3. Check Mistral API - mistral-embed actually supports 8192 tokens

Proposed resolution

Update the method to dynamically fetch the model's max_context_length from the Mistral API, with fallback to known defaults

Remaining tasks

- Implement dynamic fetching from API
- Add fallback for known models

Issue fork ai_provider_mistral-3570539

Show commands

Start within a Git clone of the project using the version control instructions.

Add & fetch this issue fork’s repository

Or, if you do not have SSH keys set up on git.drupalcode.org:

Add & fetch this issue fork’s repository

3570539-maxembeddingsinput-should-fetch changes, plain diff MR !18
Check out this branch for the first time

Check out existing branch, if you already have it locally

About issue forks

Comments

Comment #1

30 January 2026 at 10:05

petar_basic created an issue. See original summary.

Comment #2

30 January 2026 at 13:03

petar_basic opened merge request !18

Comment #3

petar_basic commented 30 January 2026 at 13:21

Assigned:	petar_basic	» Unassigned
Status:	Needs work	» Needs review

Implemented maxEmbeddingsInput() to fetch the actual max_context_length from Mistral's /v1/models API endpoint instead of returning a hardcoded value of 1024.

- Defaults to mistral-embed model if no model_id specified
- Caches the result for 24 hours (to be consistent with getConfiguredModels())
- Falls back to 1024 if the model is not found in the API response
- Added kernel tests for both the API fetch and fallback scenarios

Note on interface documentation:

The EmbeddingsInterface::maxEmbeddingsInput() docblock states it returns "Max input string length in bytes", but the actual usage in ai_search module's EmbeddingStrategyPluginBase passes this value to TextChunker, which uses a tokenizer
(getEncodedChunks). This means the value is interpreted as tokens, not bytes. The Mistral API returns max_context_length in tokens. The interface documentation should probably be updated to reflect this.

Comment #4

petar_basic commented 30 January 2026 at 13:22

Issue summary:

View changes

Comment #5

petar_basic commented 30 January 2026 at 13:45

Issue summary:

View changes

To test, this can be run:

drush php-eval "                                                                                                                                            
    \$provider = \Drupal::service('ai.provider')->createInstance('mistral');                                                                                                          
    print 'maxEmbeddingsInput for mistral-embed: ' . \$provider->maxEmbeddingsInput('mistral-embed') . PHP_EOL;                                                                       
  "

Comment #6

30 January 2026 at 23:33

fago made their first commit to this issue’s fork.

Comment #7

fago

German

Vienna

commented 30 January 2026 at 23:34

Status:

Needs review

» Reviewed & tested by the community

good find, solid fix and tests, ready!

> (getEncodedChunks). This means the value is interpreted as tokens, not bytes. The Mistral API returns max_context_length in tokens. The interface documentation should probably be updated to reflect this.

let's open an issue and file an MR to fix that then?

Comment #8

30 January 2026 at 23:38

fago committed aef865fe on 1.1.x authored by petar_basic

fix: #3570539 maxEmbeddingsInput() should fetch token limits from API...

Comment #9

fago

German

Vienna

commented 30 January 2026 at 23:39

Status:

Reviewed & tested by the community

» Fixed

Merged, so setting this one to fixed.

Comment #10

30 January 2026 at 23:39

Now that this issue is closed, review the contribution record.

As a contributor, attribute any organization that helped you, or if you volunteered your own time.

Maintainers, credit people who helped resolve this issue.

Comment #11

13 February 2026 at 23:39

Status:

Fixed

» Closed (fixed)

Automatically closed - issue fixed for 2 weeks with no activity.

maxEmbeddingsInput() should fetch token limits from API instead of hardcoded value

Problem/Motivation

Steps to reproduce

Proposed resolution

Remaining tasks

Issue fork ai_provider_mistral-3570539

Comments

Comment #1

Comment #2

Comment #3

Comment #4

Comment #5

Comment #6

Comment #7

Comment #8

Comment #9

Comment #10

Comment #11

News items

Our community

Documentation

Drupal code base

Governance of community