Posts

Showing posts from September, 2025

DeepSeek R1 paper came out

Great to see that DeepSeek R1 is published as a research paper, which allows people to understand more about the engineering and design decisions made by their incredible team.  According to a comment by the editor of Nature, this paper was reviewed by 8 experts. (Most scientific journals require only 3 independent reviewers.) This reminds me of AlphaFold, a revolutionary protein model that can fold most proteins sequence into 3D structures with accuracy comparable to crystallography.  AF was published as a scientific paper followed by a Nobel prize. However, a lot of details are missing in the paper, especially training methods, which is crucial for fine tuning the original model (if it is open sourced). Open file mimics AF with full open source and well documented training methods, which allows scientists to understand the model in detail and appreciate the engineering marble in addition to being able to fine tune the model. Love to see more publications of mode details ...