dc.creator |
Ma, Lianbo |
|
dc.creator |
Li, Nan |
|
dc.creator |
Guo, Yinan |
|
dc.creator |
Wang, Xingwei |
|
dc.creator |
Yang, Shengxiang |
|
dc.creator |
Huang, Min |
|
dc.creator |
Zhang, Hao |
|
dc.date |
2021-06-15T14:55:16Z |
|
dc.date |
2021-06-15T14:55:16Z |
|
dc.date |
2021-05 |
|
dc.date |
2021-06 |
|
dc.date.accessioned |
2023-02-22T17:04:40Z |
|
dc.date.available |
2023-02-22T17:04:40Z |
|
dc.identifier |
Ma, L., Li, N., Guo, Y., Wang, X., Yang, S., Huang, M.and Zhang, H. (2021) Learning to optimize: Reference vector reinforcement learning adaption to constrained many-objective optimization of industrial copper burdening system. IEEE Transactions on Cybernetics, in press. |
|
dc.identifier |
2168-2267 |
|
dc.identifier |
https://dora.dmu.ac.uk/handle/2086/21001 |
|
dc.identifier |
https://doi.org/10.1109/TCYB.2021.3086501 |
|
dc.identifier.uri |
http://localhost:8080/xmlui/handle/CUHPOERS/254445 |
|
dc.description |
The file attached to this record is the author's final peer reviewed version. The Publisher's final version can be found by following the DOI link. |
|
dc.description |
The performance of decomposition-based algorithms is sensitive to the Pareto front shapes since their reference vectors preset in advance are not always adaptable to various problem characteristics with no a priori knowledge. For this issue, this paper proposes an adaptive reference vector reinforcement learning approach to decomposition-based algorithms for the industrial copper burdening optimization. The proposed approach involves two main operations, i.e., a reinforcement learning operation and a reference point sampling operation. Given the fact that the states of reference vectors interact with the landscape environment (quite often), the reinforcement learning operation treats the reference vector adaption process as a reinforcement learning task, where each reference vector learns from the environmental feedback and selects optimal actions for gradually fitting the problem characteristics. Accordingly, the reference point sampling operation uses estimation-of-distribution learning models to sample new reference points. Finally, the resultant algorithm is applied to handle the proposed industrial copper burdening problem. For this problem, an adaptive penalty function and a soft constraint-based relaxing approach are used to handle complex constraints. Experimental results on both benchmark problems and real-world instances verify the competitiveness and effectiveness of the proposed algorithm. |
|
dc.format |
application/pdf |
|
dc.language |
en_US |
|
dc.publisher |
IEEE Press |
|
dc.subject |
Many-objective optimization |
|
dc.subject |
Reference vector reinforcement learning |
|
dc.subject |
Copper burdening optimization |
|
dc.title |
Learning to optimize: Reference vector reinforcement learning adaption to constrained many-objective optimization of industrial copper burdening system |
|
dc.type |
Article |
|