Reinforcement Learning for logistics

Context

Many processes in logiscs are managed manually and solved empirically or through the use of heuristics. In the area of AI, many of the processes could be automized/improved through AI algorithms. In particualr we are interested in exploring the use of Reinforcement Learning (RL) algorithms to fullfil such purpose.

Project proposal

Using RL is interesting for two different reasosns; On the one hand, the many RL algorithms (PPO, TRPO, A3C) open a wide range the possibilities and versatility as candidates to find a solution for logistics problems. On the other hand, the different extensions of RL (and in particular multi-objective and multi-agent) can be combined to propose holistic interaction and learned behavior of complex tasks.

The project will be developed to solve the problem of pick-up and delivery optimization within a warehouse under different demand parameters, packaging restrictions, and external delivery. The objective of the developed agent will be to maximize the throughput of warehouse, while minimizing the resources used (time, distance covered by workers, redundancy of trips).

The validation plan of the thesis will take place using real-world datasets.

Implementation plan

The implementation of this work will be based on the development of DQN-based algorithms to find a solutiion for the problem

Background and Literature

Contact

n.cardozo


Universidad de los Andes | Vigilada Mineducación
Reconocimiento como Universidad: Decreto 1297 del 30 de mayo de 1964.
Reconocimiento personería jurídica: Resolución 28 del 23 de febrero de 1949 Minjusticia
Edificio Mario Laserna Cra 1Este No 19A - 40 Bogotá (Colombia) | Tel: [571] 3394949 Ext: 2860, 2861, 2862 | Fax: [571] 3324325
© 2017-2025 - Departamento de Ingeniería de Sistemas y Computación