KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks.

AllImages Videos Books Maps News Shopping

KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal ...

Oct 9, 2024 · Based on this concept, we propose the Knowledge-Orthogonal Reasoning Benchmark (KOR-Bench), encompassing five task categories: Operation, Logic, ...

KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal...

openreview.net › forum

Sep 27, 2024 · This paper presents KOR-Bench, a new benchmark that tests LLMs' reasoning abilities across five categories: Operation, Logic, Cipher, Puzzle, and ...

KOR-Bench

kor-bench.github.io

Knowledge-Orthogonal Reasoning Benchmark (KOR-Bench) is designed to evaluate models' intrinsic reasoning and planning abilities by minimizing interference ...

Ge Zhang on X: "[1/n] ### Exploring the Boundaries of AI Reasoning ...

twitter.com › GeZhang86038849 › status

Oct 18, 2024 · To more accurately assess large models' reasoning in new, unfamiliar areas, we're thrilled to introduce the all-new KOR-Bench (Knowledge-Orthogonal Reasoning ...

Missing: Tasks. | Show results with:Tasks.

‪Haoran Zhang‬ - ‪Google Scholar‬

scholar.google.com › citations

Co-authors ; KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks. K Ma, X Du, Y Wang, H Zhang, Z Wen, X Qu, J Yang, J Liu, M Liu, X ...

‪Kaijing Ma‬ - ‪Google Scholar‬

scholar.google.com › citations

KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks. K Ma, X Du, Y Wang, H Zhang, Z Wen, X Qu, J Yang, J Liu, M Liu, X Yue ...

2077AIDataFoundation (2077AI) - Hugging Face

huggingface.co › ...

OmniDocBench: Benchmarking Diverse PDF Document Parsing with ... KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks.

Haoran Zhang's research works - ResearchGate

www.researchgate.net › Haoran-Zhang-2...

In this paper, we introduce Knowledge-Orthogonal Reasoning (KOR), which minimizes the impact of domain-specific knowledge for a more accurate evaluation of ...

Antonio Montano 🪄 on LinkedIn: #machinelearning

www.linkedin.com › posts › montano_m...

Oct 18, 2024 · In this paper, we introduce Knowledge-Orthogonal Reasoning (KOR), which minimizes the impact of domain-specific knowledge for a more accurate evaluation of ...

ICLR 2025 Conference Submissions - OpenReview

openreview.net › submissions › Conferen...

KOR-Bench: Benchmarking Language Models on Knowledge-Orthogonal Reasoning Tasks · 27 Sept 2024 (modified: 26 Nov 2024) · ICLR 2025 Conference Submission · Readers: ...