Scholars@Duke publication: SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models

SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models

Publication , Conference

Zhang, J; Juan, DC; Rashtchian, C; Ferng, CS; Jiang, H; Chen, Y

Published in: Advances in Neural Information Processing Systems

January 1, 2024

Large language models (LLMs) have demonstrated remarkable capabilities, but their outputs can sometimes be unreliable or factually incorrect. To address this, we introduce Self Logits Evolution Decoding (SLED), a novel decoding framework that enhances the truthfulness of LLMs without relying on external knowledge bases or requiring further fine-tuning. From an optimization perspective, our SLED framework leverages the latent knowledge embedded within the LLM by contrasting the output logits from the final layer with those from early layers. It then utilizes an approximate gradient approach to enable latent knowledge to guide the self-refinement of outputs, thereby effectively improving factual accuracy. Extensive experiments have been conducted on established benchmarks across a diverse range of model families (LLaMA 2, LLaMA 3, Gemma) and scales (from 2B to 70B), including more advanced architectural configurations such as the mixture of experts (MoE). Our evaluation spans a wide variety of tasks, including multi-choice, open-generation, and adaptations to chain-of-thought reasoning tasks. The results demonstrate that SLED consistently improves factual accuracy by up to 20% compared to existing decoding methods while maintaining natural language fluency and negligible latency overhead. Furthermore, it can be flexibly combined with other decoding methods to further enhance their performance.

Duke Scholars

Author Yiran Chen Electrical and Computer Engineering

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

Related Subject Headings

4611 Machine learning
1702 Cognitive Sciences
1701 Psychology

Citation

APA

Chicago

ICMJE

MLA

NLM

Zhang, J., Juan, D. C., Rashtchian, C., Ferng, C. S., Jiang, H., & Chen, Y. (2024). SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models. In Advances in Neural Information Processing Systems (Vol. 37).

Zhang, J., D. C. Juan, C. Rashtchian, C. S. Ferng, H. Jiang, and Y. Chen. “SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models.” In Advances in Neural Information Processing Systems, Vol. 37, 2024.

Zhang J, Juan DC, Rashtchian C, Ferng CS, Jiang H, Chen Y. SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models. In: Advances in Neural Information Processing Systems. 2024.

Zhang, J., et al. “SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models.” Advances in Neural Information Processing Systems, vol. 37, 2024.

Zhang J, Juan DC, Rashtchian C, Ferng CS, Jiang H, Chen Y. SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models. Advances in Neural Information Processing Systems. 2024.

Published In

Advances in Neural Information Processing Systems

ISSN

1049-5258

Publication Date

January 1, 2024

Volume

Related Subject Headings

4611 Machine learning
1702 Cognitive Sciences
1701 Psychology