Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving

Azarafza, Mehdi; Nayyeri, Mojtaba; Steinmetz, Charles; Staab, Steffen; Rettberg, Achim

Computer Science > Computer Vision and Pattern Recognition

arXiv:2402.13602 (cs)

[Submitted on 21 Feb 2024 (v1), last revised 19 Aug 2024 (this version, v4)]

Title:Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving

Authors:Mehdi Azarafza, Mojtaba Nayyeri, Charles Steinmetz, Steffen Staab, Achim Rettberg

View PDF HTML (experimental)

Abstract:Large Language Models (LLMs) have garnered significant attention for their ability to understand text and images, generate human-like text, and perform complex reasoning tasks. However, their ability to generalize this advanced reasoning with a combination of natural language text for decision-making in dynamic situations requires further exploration. In this study, we investigate how well LLMs can adapt and apply a combination of arithmetic and common-sense reasoning, particularly in autonomous driving scenarios. We hypothesize that LLMs hybrid reasoning abilities can improve autonomous driving by enabling them to analyze detected object and sensor data, understand driving regulations and physical laws, and offer additional context. This addresses complex scenarios, like decisions in low visibility (due to weather conditions), where traditional methods might fall short. We evaluated Large Language Models (LLMs) based on accuracy by comparing their answers with human-generated ground truth inside CARLA. The results showed that when a combination of images (detected objects) and sensor data is fed into the LLM, it can offer precise information for brake and throttle control in autonomous vehicles across various weather conditions. This formulation and answers can assist in decision-making for auto-pilot systems.

Comments:	12 pages, 5 figures
Subjects:	Computer Vision and Pattern Recognition (cs.CV); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2402.13602 [cs.CV]
	(or arXiv:2402.13602v4 [cs.CV] for this version)
	https://doi.org/10.48550/arXiv.2402.13602

Submission history

From: Mehdi Azarafza [view email]
[v1] Wed, 21 Feb 2024 08:09:05 UTC (4,054 KB)
[v2] Thu, 7 Mar 2024 12:24:11 UTC (4,053 KB)
[v3] Mon, 18 Mar 2024 09:50:00 UTC (4,222 KB)
[v4] Mon, 19 Aug 2024 13:27:55 UTC (4,561 KB)

Computer Science > Computer Vision and Pattern Recognition

Title:Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computer Vision and Pattern Recognition

Title:Hybrid Reasoning Based on Large Language Models for Autonomous Car Driving

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators