The model must be autoregressive. It receives a token sequence as input and predicts the next token. Output digits are generated one at a time, with each new token fed back as input for predicting the next. The carry propagation must emerge from this autoregressive process — not from explicit state variables passed between steps in Python.
生成的实体类大致如下(已简化,关键部分):
,详情可参考搜狗输入法2026
30-day money-back guarantee
// Finally, we release the lock on the stream