Sangam: A Confluence of Knowledge Streams

Learning to optimize: Reference vector reinforcement learning adaption to constrained many-objective optimization of industrial copper burdening system

Show simple item record

dc.creator Ma, Lianbo
dc.creator Li, Nan
dc.creator Guo, Yinan
dc.creator Wang, Xingwei
dc.creator Yang, Shengxiang
dc.creator Huang, Min
dc.creator Zhang, Hao
dc.date 2021-06-15T14:55:16Z
dc.date 2021-06-15T14:55:16Z
dc.date 2021-05
dc.date 2021-06
dc.date.accessioned 2023-02-22T17:04:40Z
dc.date.available 2023-02-22T17:04:40Z
dc.identifier Ma, L., Li, N., Guo, Y., Wang, X., Yang, S., Huang, M.and Zhang, H. (2021) Learning to optimize: Reference vector reinforcement learning adaption to constrained many-objective optimization of industrial copper burdening system. IEEE Transactions on Cybernetics, in press.
dc.identifier 2168-2267
dc.identifier https://dora.dmu.ac.uk/handle/2086/21001
dc.identifier https://doi.org/10.1109/TCYB.2021.3086501
dc.identifier.uri http://localhost:8080/xmlui/handle/CUHPOERS/254445
dc.description The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI link.
dc.description The performance of decomposition-based algorithms is sensitive to the Pareto front shapes since their reference vectors preset in advance are not always adaptable to various problem characteristics with no a priori knowledge. For this issue, this paper proposes an adaptive reference vector reinforcement learning approach to decomposition-based algorithms for the industrial copper burdening optimization. The proposed approach involves two main operations, i.e., a reinforcement learning operation and a reference point sampling operation. Given the fact that the states of reference vectors interact with the landscape environment (quite often), the reinforcement learning operation treats the reference vector adaption process as a reinforcement learning task, where each reference vector learns from the environmental feedback and selects optimal actions for gradually fitting the problem characteristics. Accordingly, the reference point sampling operation uses estimation-of-distribution learning models to sample new reference points. Finally, the resultant algorithm is applied to handle the proposed industrial copper burdening problem. For this problem, an adaptive penalty function and a soft constraint-based relaxing approach are used to handle complex constraints. Experimental results on both benchmark problems and real-world instances verify the competitiveness and effectiveness of the proposed algorithm.
dc.format application/pdf
dc.language en_US
dc.publisher IEEE Press
dc.subject Many-objective optimization
dc.subject Reference vector reinforcement learning
dc.subject Copper burdening optimization
dc.title Learning to optimize: Reference vector reinforcement learning adaption to constrained many-objective optimization of industrial copper burdening system
dc.type Article


Files in this item

Files Size Format View
IEEETCYB21.pdf 3.583Mb application/pdf View/Open

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse