Fix Step 3.5 Flash model conversion by kernelpool · Pull Request #840 · ml-explore/mlx-lm · GitHub

kernelpool · 2026-02-03T09:04:17Z

Fix to avoid applying the RMSNorm delta twice, at conversion and subsequently at load.
This simply reverts back to the original approach from b8c4549. Maybe theres a better way?

More info: #836 (comment)

awni · 2026-02-03T14:12:15Z

Yes you can just check if any of the original (un-mapped) keys are still in the weight keys. If they are it means it hasn't been converted yet to MLX format so it's safe to apply the +1.0

ghost · 2026-02-03T14:26:11Z

Hi Guys,just some feedback. This time I run the 8bit uploaded by myself and also the 4bit model uploaded by kernelpool under many different chat situation rather than the given test command.

It looks like they all have repetition problems. They might (almost 100% when the question is long) repeat certain words forever, the words is context related. Some times a whole short sentence is repeated.

It's not production ready for now. Don't know why, I'm just a test user, sorry.

kernelpool · 2026-02-03T20:19:56Z

@awni What do you think about the < 0.5 check? Otherwise we need to re-upload the existing models.

kernelpool · 2026-02-03T20:24:04Z

Hi Guys,just some feedback. This time I run the 8bit uploaded by myself and also the 4bit model uploaded by kernelpool under many different chat situation rather than the given test command.

It looks like they all have repetition problems. They might (almost 100% when the question is long) repeat certain words forever, the words is context related. Some times a whole short sentence is repeated.

It's not production ready for now. Don't know why, I'm just a test user, sorry.

What model parameters (temperature, etc) are you using?

awni · 2026-02-03T20:27:12Z

@awni What do you think about the < 0.5 check? Otherwise we need to re-upload the existing models.

I don't think we should do it that way. It's somewhat brittle to the mean of the weights and also breaks lazy loading to some extent.

I think we should just check the presence of a pattern in the weight keys to determine if it's already been converted. And I will re-uploading the models, that's not so difficult. (But others will have to reconvert or re-download).

ghost · 2026-02-03T23:34:33Z

What model parameters (temperature, etc) are you using?

I tried different temperatures, like 0.6, 1, they all behave the same way. top-p 0.95/1 also.

awni

Thanks!

awni · 2026-02-04T00:32:37Z

I will re-upload the models as soon as this lands.

ghost · 2026-02-04T01:46:48Z

Yup I confirm the old models are not working for the new commit.(Output nonsense again.) reupload required.

Fix Step 3.5 Flash model conversion

80fab60

kernelpool mentioned this pull request Feb 3, 2026

Add Step 3.5 Flash #836

Merged

Detect converted norm weights

166ebb9

kernelpool force-pushed the fix-step35-model branch from 8bbec69 to 166ebb9 Compare February 3, 2026 20:05

Check layer names

a476115

awni approved these changes Feb 4, 2026

View reviewed changes

awni merged commit b77ec6b into ml-explore:main Feb 4, 2026
2 checks passed

kernelpool mentioned this pull request Feb 4, 2026

Fix sliding window mask during generation #843

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix Step 3.5 Flash model conversion#840

Fix Step 3.5 Flash model conversion#840
awni merged 3 commits into
ml-explore:mainfrom
kernelpool:fix-step35-model

kernelpool commented Feb 3, 2026 •

edited

Loading

Uh oh!

awni commented Feb 3, 2026

Uh oh!

ghost commented Feb 3, 2026 •

edited by ghost

Loading

Uh oh!

kernelpool commented Feb 3, 2026

Uh oh!

kernelpool commented Feb 3, 2026

Uh oh!

awni commented Feb 3, 2026 •

edited

Loading

Uh oh!

ghost commented Feb 3, 2026 •

edited by ghost

Loading

Uh oh!

awni left a comment

Uh oh!

awni commented Feb 4, 2026

Uh oh!

Uh oh!

ghost commented Feb 4, 2026 •

edited by ghost

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kernelpool commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

awni commented Feb 3, 2026

Uh oh!

ghost commented Feb 3, 2026 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kernelpool commented Feb 3, 2026

Uh oh!

kernelpool commented Feb 3, 2026

Uh oh!

awni commented Feb 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ghost commented Feb 3, 2026 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

awni left a comment

Choose a reason for hiding this comment

Uh oh!

awni commented Feb 4, 2026

Uh oh!

Uh oh!

ghost commented Feb 4, 2026 • edited by ghost Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kernelpool commented Feb 3, 2026 •

edited

Loading

ghost commented Feb 3, 2026 •

edited by ghost

Loading

awni commented Feb 3, 2026 •

edited

Loading

ghost commented Feb 3, 2026 •

edited by ghost

Loading

ghost commented Feb 4, 2026 •

edited by ghost

Loading