Learning to optimize: Reference vector reinforcement learning adaption to constrained many-objective optimization of industrial copper burdening system

Ma, Lianbo; Li, Nan; Guo, Yinan; Wang, Xingwei; Yang, Shengxiang; Huang, Min; Zhang, Hao

Sangam Home
→
Electronic Theses and Dissertations (ETDs)
→
De Montfort University Open Research Archive (DORA)
→
View Item

dc.creator	Ma, Lianbo
dc.creator	Li, Nan
dc.creator	Guo, Yinan
dc.creator	Wang, Xingwei
dc.creator	Yang, Shengxiang
dc.creator	Huang, Min
dc.creator	Zhang, Hao
dc.date	2021-06-15T14:55:16Z
dc.date	2021-06-15T14:55:16Z
dc.date	2021-05
dc.date	2021-06
dc.date.accessioned	2023-02-22T17:04:40Z
dc.date.available	2023-02-22T17:04:40Z
dc.identifier	Ma, L., Li, N., Guo, Y., Wang, X., Yang, S., Huang, M.and Zhang, H. (2021) Learning to optimize: Reference vector reinforcement learning adaption to constrained many-objective optimization of industrial copper burdening system. IEEE Transactions on Cybernetics, in press.
dc.identifier	2168-2267
dc.identifier	https://dora.dmu.ac.uk/handle/2086/21001
dc.identifier	https://doi.org/10.1109/TCYB.2021.3086501
dc.identifier.uri	http://localhost:8080/xmlui/handle/CUHPOERS/254445
dc.description	The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI link.
dc.description	The performance of decomposition-based algorithms is sensitive to the Pareto front shapes since their reference vectors preset in advance are not always adaptable to various problem characteristics with no a priori knowledge. For this issue, this paper proposes an adaptive reference vector reinforcement learning approach to decomposition-based algorithms for the industrial copper burdening optimization. The proposed approach involves two main operations, i.e., a reinforcement learning operation and a reference point sampling operation. Given the fact that the states of reference vectors interact with the landscape environment (quite often), the reinforcement learning operation treats the reference vector adaption process as a reinforcement learning task, where each reference vector learns from the environmental feedback and selects optimal actions for gradually fitting the problem characteristics. Accordingly, the reference point sampling operation uses estimation-of-distribution learning models to sample new reference points. Finally, the resultant algorithm is applied to handle the proposed industrial copper burdening problem. For this problem, an adaptive penalty function and a soft constraint-based relaxing approach are used to handle complex constraints. Experimental results on both benchmark problems and real-world instances verify the competitiveness and effectiveness of the proposed algorithm.
dc.format	application/pdf
dc.language	en_US
dc.publisher	IEEE Press
dc.subject	Many-objective optimization
dc.subject	Reference vector reinforcement learning
dc.subject	Copper burdening optimization
dc.title	Learning to optimize: Reference vector reinforcement learning adaption to constrained many-objective optimization of industrial copper burdening system
dc.type	Article

Files in this item

Files	Size	Format	View
IEEETCYB21.pdf	3.583Mb	application/pdf	View/Open

This item appears in the following Collection(s)

De Montfort University Open Research Archive (DORA) [81]
De Montfort University (DMU)

Show simple item record

Search DSpace

Advanced Search

Browse

All of DSpace
This Collection
- By Issue Date
- Authors
- Titles
- Subjects

Learning to optimize: Reference vector reinforcement learning adaption to constrained many-objective optimization of industrial copper burdening system

Files in this item

This item appears in the following Collection(s)

Search DSpace

Browse

All of DSpace

This Collection