Add K-EXAONE MoE by kernelpool · Pull Request #719 · ml-explore/mlx-lm · GitHub

kernelpool · 2026-01-03T09:07:51Z

Fix for issue #718 :)

Model: https://huggingface.co/LGAI-EXAONE/K-EXAONE-236B-A23B

Note

The probability of </think> gets significantly reduced on 4-bit (see table below), resulting in very long reasoning chains. I've tried to adjust the layer quantization, but wasn't able to find a silver bullet to the issue. I'd recommend sticking to 5-bit or above for now.

Metric	4-bit	5-bit	6-bit	8-bit
`</think>` probability	0.0395	0.6044	0.7539	0.7509
Top-1 token	`But`	`</think>`	`</think>`	`</think>`

Benchmark

Mac Studio M3 Ultra, 512 prompt tokens, 1024 generation tokens

Quantization	Prompt (tok/s)	Generation (tok/s)	Peak Memory (GB)
4-bit	343.56	37.41	134.01
6-bit	318.00	28.21	193.14
8-bit	305.80	23.61	252.24

Generation

mlx_lm.generate --model /Volumes/WD_EXTRA/models/catalyst/K-EXAONE-236B-A23B-6bit --prompt "Who is Albert Einstein?" -m 4096
The tokenizer you are loading from '/Volumes/WD_EXTRA/models/catalyst/K-EXAONE-236B-A23B-6bit' with an incorrect regex pattern: https://huggingface.co/mistralai/Mistral-Small-3.1-24B-Instruct-2503/discussions/84#69121093e8b480e709447d5e. This will lead to incorrect tokenization. You should set the `fix_mistral_regex=True` flag when loading this tokenizer to fix this issue.
==========
Okay, the user is asking, "Who is Albert Einstein?" Hmm, this seems like a straightforward question, but I should consider why they're asking. Maybe they're a student working on a school project, or someone curious about science history. Could also be a non-native English speaker checking basic knowledge.  

First, I need to cover the absolute essentials: his full name, lifespan, nationality, and field. But just listing facts won't be enough—they probably want to understand *why* he's famous. The E=mc² equation is iconic, but I should explain what it *means* in simple terms.  

Wait, should I mention his Nobel Prize? Yes, but clarify it wasn't for relativity—that's a common misconception. People often think he won for E=mc², but it was actually for the photoelectric effect. Important to correct that.  

Also, his personality matters. He wasn't just a lab scientist; he had a playful side (those wild hair, the pipe!), and he cared about social issues. The user might appreciate knowing he was human—flawed, with personal struggles.  

Oh, and the refugee angle! He fled Nazi Germany, which adds depth. Shows how politics affects science. Maybe the user is interested in historical context too.  

Should I address why he's *still* relevant? Like how relativity underpins GPS tech today? That makes it feel less like ancient history.  

...But don't overload them. Keep it structured:  
- Who he was (basic bio)  
- What he *did* (key contributions)  
- Why he matters (legacy)  
- Fun human details (to make him relatable)  

Also, avoid jargon. No "spacetime curvature" without explaining it's like gravity bending a trampoline. Wait—maybe save that for the relativity part.  

*Double-checking dates*: Born 1879, died 1955. Yep. Swiss, later German, then American. Got it.  

Nobel year: 1921, awarded 1922. Photoelectric effect paper was 1905—his "miracle year." Should highlight that year; it's wild he published four groundbreaking papers in one go.  

...And end with why he's a cultural icon. The "genius" stereotype, but also his humility ("I have no special talent..."). That quote might resonate.  

User probably doesn't need a bibliography, but if they follow up, I can suggest biographies. For now, keep it concise but vivid.  

*Pauses*  
Wait—did they mean *only* the science? Or the myth? Better cover both. The science is why he's important; the myth is why everyone *knows* him.  

Alright, structure:  
1. Quick intro (name, lifespan)  
2. Core achievements (relativity, E=mc², Nobel)  
3. Human side (personality, struggles)  
4. Legacy (science + culture)  

...And emphasize that relativity changed *everything*—not just physics, but how we see the universe. But don't get too technical.  

*Final check*: Am I missing why a regular person should care? Yes—add how his ideas affect daily life (GPS, nuclear energy). Makes it tangible.  

Okay, drafting now...
</think>

Albert Einstein (1879–1955) was a **German-born theoretical physicist** who revolutionized modern physics and became one of the most iconic scientists in history. Here's a concise yet comprehensive overview of who he was and why he matters:

---

### **Core Identity**
- **Full Name**: Albert Einstein  
- **Born**: March 14, 1879, in Ulm, German Empire (now Germany)  
- **Died**: April 18, 1955, in Princeton, New Jersey, USA  
- **Nationality**: Swiss (1896–1901), German (1901–1933), American (1940–1955)  
- **Field**: Theoretical physics, cosmology, and philosophy of science  

---

### **Key Contributions & Why He's Famous**
1. **Theory of Relativity**  
   - **Special Relativity (1905)**: Introduced the revolutionary idea that **space and time are interwoven** into a single continuum ("spacetime"). Key insights:  
     - The speed of light is constant for all observers.  
     - **E = mc²**: Mass and energy are equivalent (energy = mass × speed of light squared). This underpins nuclear energy, particle physics, and cosmology.  
   - **General Relativity (1915)**: Redefined gravity not as a "force" but as the **curvature of spacetime** caused by mass and energy. Predicted black holes, gravitational waves, and the expanding universe—later confirmed by observations.  

2. **Nobel Prize in Physics (1921)**  
   - Awarded **not for relativity** (which was still debated) but for his explanation of the **photoelectric effect**—proving light behaves as particles ("photons"). This work laid the foundation for **quantum mechanics**.  

3. **Other Major Work**  
   - **Brownian motion** (1905): Provided evidence for atoms' existence.  
   - **Bose-Einstein statistics** (1924): Described how certain particles behave at near-absolute zero (leading to Bose-Einstein condensates).  
   - **Unified field theory**: Spent his later years seeking a single framework to unify gravity and electromagnetism (unsuccessful, but inspired future research).  

---

### **Beyond Physics: His Humanity & Legacy**
- **Personality**: Known for **playful curiosity**, humility, and deep ethical concerns. Famously said:  
  > *"Imagination is more important than knowledge. Knowledge is limited. Imagination encircles the world."*  
  He was also a **civil rights advocate**, anti-war activist, and vocal critic of nationalism.  
- **Refugee from Nazism**: Fled Germany in 1933 (as Hitler rose to power) due to his Jewish heritage and pacifist views. Settled in the U.S., where he worked at the Institute for Advanced Study in Princeton.  
- **Cultural Icon**: His wild hair, thoughtful eyes, and gentle demeanor made him a global symbol of "genius." He used his fame to promote peace, education, and humanitarian causes.  
- **Controversies**: His personal life was complex (e.g., a strained relationship with his first wife, Mileva Marić, and rumored affairs).  

---

### **Why He Still Matters Today**
- **Science**: Relativity is essential for GPS technology, cosmology (e.g., understanding the Big Bang), and nuclear energy.  
- **Philosophy**: Challenged absolute notions of time, space, and reality—reshaping how humanity views the universe.  
- **Culture**: Embodies the power of curiosity, critical thinking, and ethical responsibility in science.  

> 💡 **Fun Fact**: In 1905—his "miracle year"—Einstein published **four groundbreaking papers** (on relativity, photoelectric effect, Brownian motion, and mass-energy equivalence) while working as a patent clerk.

Einstein wasn't just a scientist; he was a **visionary who redefined our cosmic perspective** and reminded us that science must serve humanity. His legacy endures in every GPS device, every discussion about black holes, and every child who looks up at the stars and wonders. 🌌
==========
Prompt: 13 tokens, 8.858 tokens-per-sec
Generation: 1556 tokens, 28.408 tokens-per-sec
Peak memory: 192.377 GB

# Conflicts: # tests/test_models.py

awni

Looks great!

kernelpool added 4 commits January 3, 2026 19:38

Add K-EXAONE MoE

2afcc80

Shard model

a760f30

Cleanup

89a2b01

Merge remote-tracking branch 'origin/main' into feature/exaone_moe

3da6e8a

# Conflicts: # tests/test_models.py

awni approved these changes Jan 5, 2026

View reviewed changes

Fix model test

4169b8f

awni merged commit 7a4d137 into ml-explore:main Jan 5, 2026
2 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add K-EXAONE MoE#719

Add K-EXAONE MoE#719
awni merged 5 commits into
ml-explore:mainfrom
kernelpool:feature/exaone_moe

kernelpool commented Jan 3, 2026 •

edited

Loading

Uh oh!

awni left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

kernelpool commented Jan 3, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Note

Benchmark

Generation

Uh oh!

awni left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

kernelpool commented Jan 3, 2026 •

edited

Loading