Skeleton-of-Thought: Large Language Models Can Do Parallel Decoding
Paper
•
2307.15337
•
Published
•
38
None defined yet.
Think-at-Hard: Selective Latent Iterations to Improve Reasoning Language Models
Cache-to-Cache: Direct Semantic Communication Between Large Language Models