Task-Based Language Teaching

BenchING: A Benchmark for Evaluating Large Language Models in Following Structured Output Format Instruction in Text-Based Narrative Game Tasks

Abstract: In this article, we present BenchING, a new benchmark for evaluating large language models (LLMs) on their ability to follow structured output format instructions in text-based procedural ...

IEEE

A Comparative Analysis of Large Language Models with Retrieval-Augmented Generation based Question Answering System

Abstract: In recent studies, Large Language Models (LLMs) have shown remarkable effectiveness in a wide range of natural language processing tasks. However, their knowledge is limited to the data they ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

BenchING: A Benchmark for Evaluating Large Language Models in Following Structured Output Format Instruction in Text-Based Narrative Game Tasks

A Comparative Analysis of Large Language Models with Retrieval-Augmented Generation based Question Answering System

Trending now