faithfulness

Here are 5 public repositories matching this topic...

pkuserc / ChatGPT_for_IE

Evaluating ChatGPT’s Information Extraction Capabilities: An Assessment of Performance, Explainability, Calibration, and Faithfulness

performance evaluation information-extraction calibration named-entity-recognition event-detection event-extraction relation-extraction entity-typing relation-classification explainability large-language-models chatgpt faithfulness

Updated Nov 21, 2023
Python

MinhVuong2000 / LLMReasonCert

Star

Official Implementation of ACL2024 paper "Direct Evaluation of Chain-of-Thought in Multi-hop Reasoning with Knowledge Graphs"(https://arxiv.org/abs/2402.11199).

framework evaluation knowledge-graph reasoning evaluation-framework llms faithfulness

Updated Jul 27, 2024
Python

YisongMiao / DiSQ-Score

Star

The Dataset and Official Implementation for <Discursive Socratic Questioning: Evaluating the Faithfulness of Language Models’ Understanding of Discourse Relations> @ ACL 2024

evaluation discourse language-model faithfulness socratic-method