Skip to content

Conversation

@patrickvonplaten
Copy link
Contributor

@patrickvonplaten patrickvonplaten commented Dec 8, 2025

Improve script for FP8 conversion.
Tested on all relevant checkpoints (both Devstral 2 & Ministral 3B Reasoning & 14B - FP8)

@patrickvonplaten patrickvonplaten marked this pull request as draft December 8, 2025 19:13
@patrickvonplaten patrickvonplaten marked this pull request as ready for review December 9, 2025 10:38
@patrickvonplaten patrickvonplaten changed the title [Mistral Convert] Small addition fp8 convert [Devstral] Make sure FP8 conversion works correctly Dec 9, 2025
@github-actions
Copy link
Contributor

github-actions bot commented Dec 9, 2025

[For maintainers] Suggested jobs to run (before merge)

run-slow: ministral3

@patrickvonplaten patrickvonplaten marked this pull request as draft December 9, 2025 17:11
@patrickvonplaten patrickvonplaten marked this pull request as ready for review December 9, 2025 17:11
Copy link
Member

@SunMarc SunMarc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks ! LGTM, just a nit


# No vision
if original_vision_config is None:
new_text_config = Ministral3Config(**new_text_config_kwargs, **get_maybe_quant_config())
Copy link
Member

@SunMarc SunMarc Dec 10, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

let's just add the quantization config in new_config and not potentially in the new_text_config. In the config.json that you converted, everything looks good

Copy link
Contributor

@MekkCyber MekkCyber left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thank you

@HuggingFaceDocBuilderDev

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

5 participants