<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE ArticleSet PUBLIC "-//NLM//DTD PubMed 2.7//EN" "https://dtd.nlm.nih.gov/ncbi/pubmed/in/PubMed.dtd">
<ArticleSet>
<Article>
<Journal>
				<PublisherName>Iranian Society of Cryptology</PublisherName>
				<JournalTitle>The ISC International Journal of Information Security</JournalTitle>
				<Issn>2008-2045</Issn>
				<Volume></Volume>
				<Issue>Articles in Press</Issue>
				<PubDate PubStatus="epublish">
					<Year>2026</Year>
					<Month>03</Month>
					<Day>12</Day>
				</PubDate>
			</Journal>
<ArticleTitle>A Multi-Objective Reinforcement Learning Framework for Security Enhancement in Autonomous Vehicle</ArticleTitle>
<VernacularTitle></VernacularTitle>
			<FirstPage></FirstPage>
			<LastPage></LastPage>
			<ELocationID EIdType="pii">242014</ELocationID>
			
<ELocationID EIdType="doi">10.22042/isecure.2026.242014</ELocationID>
			
			<Language>EN</Language>
<AuthorList>
<Author>
					<FirstName>Arman</FirstName>
					<LastName>Moradi</LastName>
<Affiliation>Department of Computer Engineering, Faculty of Engineering, University of Guilan</Affiliation>

</Author>
<Author>
					<FirstName>Mehran</FirstName>
					<LastName>Alidoost Nia</LastName>
<Affiliation>Faculty of Computer Science and Engineering, Shahid Beheshti University</Affiliation>

</Author>
<Author>
					<FirstName>Reza</FirstName>
					<LastName>Ebrahimi Atani</LastName>
<Affiliation>Department of Computer Engineering, Faculty of Engineering, University of Guilan</Affiliation>

</Author>
</AuthorList>
				<PublicationType>Journal Article</PublicationType>
		<Abstract>Autonomous vehicles must balance road-safety objectives with growing cybersecurity threats. In this paper, we present a reinforcement-learning framework that jointly optimizes driving performance and resilience to Denial-of-Service (DoS) attacks.The problem is formulated as a multi-objective Markov Decision Process that integrates a safety reward with a security reward, while the partial observability of attacks is captured via a Bayesian belief. A Proximal Policy Optimization (PPO) agent controls steering, throttle, and dedicated mitigation actions. The system is implemented in the CARLA simulator with camera and LiDAR inputs and evaluated on urban driving scenarios. Experimental results demonstrate that the agent sustains stable lane-keeping and target-speed performance, while substantially reducing collision-prone incidents and retaining more than 90 % of the nominal travel distance under attack scenarios. The framework outperforms the safety-only PPO baseline and a rule-based security countermeasure.</Abstract>
		<ObjectList>
			<Object Type="keyword">
			<Param Name="value">Security of Autonomous Vehicles</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Cyber-Physical Systems Security</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">safety</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Self-Driving Cars</Param>
			</Object>
			<Object Type="keyword">
			<Param Name="value">Autonomous systems</Param>
			</Object>
		</ObjectList>
<ArchiveCopySource DocType="pdf">https://www.isecure-journal.com/article_242014_f8a32ab0c18638d9013d22af3e94103b.pdf</ArchiveCopySource>
</Article>
</ArticleSet>
