skip to main content
Skip header Section
Dynamic Programming and Optimal ControlNovember 2000
Publisher:
  • Athena Scientific
ISBN:978-1-886529-09-0
Published:01 November 2000
Pages:
520
Skip Bibliometrics Section
Reflects downloads up to 24 Dec 2024Bibliometrics
Abstract

No abstract available.

Cited By

  1. He X, You C and Quek T (2024). Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning Approach, IEEE Transactions on Mobile Computing, 23:10, (9881-9897), Online publication date: 1-Oct-2024.
  2. Barde S (2024). Efficient opportunistic maintenance strategies via pruning in parallel–series systems with economic dependence, Computers and Industrial Engineering, 196:C, Online publication date: 1-Oct-2024.
  3. Leleux P, Lebichot B, Guex G and Saerens M (2024). Sparse randomized policies for Markov decision processes based on Tsallis divergence regularization, Knowledge-Based Systems, 300:C, Online publication date: 27-Sep-2024.
  4. Khairy S and Balaprakash P (2024). Multi-fidelity reinforcement learning with control variates, Neurocomputing, 597:C, Online publication date: 7-Sep-2024.
  5. ACM
    Sridhar H, Huang G, Thorpe A, Oishi M and Pitts B (2024). Characterizing the Effect of Mind Wandering on Braking Dynamics in Partially Autonomous Vehicles, ACM Transactions on Cyber-Physical Systems, 8:3, (1-21), Online publication date: 31-Jul-2024.
  6. Xiong W, Liu Q, Li F, Wang B and Zhu F (2024). Personalized federated reinforcement learning, Expert Systems with Applications: An International Journal, 238:PF, Online publication date: 15-Mar-2024.
  7. Tsur D, Aharoni Z, Goldfeld Z and Permuter H (2024). Data-Driven Optimization of Directed Information Over Discrete Alphabets, IEEE Transactions on Information Theory, 70:3, (1652-1670), Online publication date: 1-Mar-2024.
  8. Zacharias C, Liu N and Begen M (2024). Dynamic Interday and Intraday Scheduling, Operations Research, 72:1, (317-335), Online publication date: 1-Jan-2024.
  9. Rozada S, Paternain S and Marques A (2024). Tensor and Matrix Low-Rank Value-Function Approximation in Reinforcement Learning, IEEE Transactions on Signal Processing, 72, (1634-1649), Online publication date: 1-Jan-2024.
  10. Kalogiannis F and Panageas I Zero-sum polymatrix Markov games Proceedings of the 37th International Conference on Neural Information Processing Systems, (59996-60020)
  11. Chen W, Banerjee T, George J and Busart C Reinforcement Learning with an Abrupt Model Change Proceedings of the Winter Simulation Conference, (3014-3025)
  12. ACM
    Pan J, Sun Y and Shroff N (2023). Sampling for Remote Estimation of the Wiener Process over an Unreliable Channel, Proceedings of the ACM on Measurement and Analysis of Computing Systems, 7:3, (1-41), Online publication date: 7-Dec-2023.
  13. ACM
    Xu R, Bhandari J, Korenkevych D, Liu F, He Y, Nikulkov A and Zhu Z Optimizing Long-term Value for Auction-Based Recommender Systems via On-Policy Reinforcement Learning Proceedings of the 17th ACM Conference on Recommender Systems, (955-962)
  14. ACM
    Grandia R, Farshidian F, Knoop E, Schumacher C, Hutter M and Bächer M (2023). DOC: Differentiable Optimal Control for Retargeting Motions onto Legged Robots, ACM Transactions on Graphics, 42:4, (1-14), Online publication date: 1-Aug-2023.
  15. Xing W, Zhao X, Başar T and Xia W (2023). Optimal transmission scheduling for remote state estimation in CPSs with energy harvesting two-hop relay networks, Automatica (Journal of IFAC), 152:C, Online publication date: 1-Jun-2023.
  16. Dong J, Shen L, Xu Y and Wang B Provably Efficient Convergence of Primal-Dual Actor-Critic with Nonlinear Function Approximation Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, (2640-2642)
  17. Bennouna A, Joseph J, Nze-Ndong D, Perakis G, Singhvi D, Lami O, Spantidakis Y, Thayaparan L and Tsiourvas A (2023). COVID-19, Manufacturing & Service Operations Management, 25:3, (1013-1032), Online publication date: 1-May-2023.
  18. Lin C, Shang K and Sun P (2023). Wait Time–Based Pricing for Queues with Customer-Chosen Service Times, Management Science, 69:4, (2127-2146), Online publication date: 1-Apr-2023.
  19. Crispino G, Freire V and Delgado K (2023). GUBS criterion, Artificial Intelligence, 316:C, Online publication date: 1-Mar-2023.
  20. Mandal D, Radanović G, Gan J, Singla A and Majumdar R Online reinforcement learning with uncertain episode lengths Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence, (9064-9071)
  21. Bu J, Gong X and Chao X (2023). Asymptotic Optimality of Base-Stock Policies for Perishable Inventory Systems, Management Science, 69:2, (846-864), Online publication date: 1-Feb-2023.
  22. Li L and Fu J Topological Approximate Dynamic Programming under Temporal Logic Constraints 2019 IEEE 58th Conference on Decision and Control (CDC), (5330-5337)
  23. Tsukamoto H and Chung S Convex Optimization-based Controller Design for Stochastic Nonlinear Systems using Contraction Analysis 2019 IEEE 58th Conference on Decision and Control (CDC), (8196-8203)
  24. Peng Y and Zhang G Thompson Sampling Meets Ranking and Selection Proceedings of the Winter Simulation Conference, (3075-3086)
  25. Qin K, Hong L and Fan W Non-Myopic Knowledge Gradient Policy for Ranking and Selection Proceedings of the Winter Simulation Conference, (3051-3062)
  26. Papadigenopoulos O, Caramanis C and Shakkottai S Non-stationary bandits under recharging payoffs Proceedings of the 36th International Conference on Neural Information Processing Systems, (20325-20337)
  27. Lin Y, Ren Y and Zhou E Bayesian risk Markov decision processes Proceedings of the 36th International Conference on Neural Information Processing Systems, (17430-17442)
  28. Bura A, Zonuzy A, Kalathil D, Shakkottai S and Chamberland J DOPE Proceedings of the 36th International Conference on Neural Information Processing Systems, (1047-1059)
  29. ACM
    Peng C and Mitra U Decentralized Scheduling of a Cognitive Multihop Underwater Acoustic Network with Interference Constraint Proceedings of the 16th International Conference on Underwater Networks & Systems, (1-8)
  30. ACM
    Zhang Q, Wei H, Wang W and Ying L On low-complexity quickest intervention of mutated diffusion processes through local approximation Proceedings of the Twenty-Third International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing, (141-150)
  31. Shemuel E, Sabag O and Permuter H (2022). The Feedback Capacity of Noisy Output Is the STate (NOST) Channels, IEEE Transactions on Information Theory, 68:8, (5044-5059), Online publication date: 1-Aug-2022.
  32. ACM
    Puranik B, Madhow U and Pedarsani R A Dynamic Decision-Making Framework Promoting Long-Term Fairness Proceedings of the 2022 AAAI/ACM Conference on AI, Ethics, and Society, (547-556)
  33. Tsur D, Aharoni Z, Goldfeld Z and Permuter H Optimizing Estimated Directed Information over Discrete Alphabets 2022 IEEE International Symposium on Information Theory (ISIT), (2898-2903)
  34. Keeler J, Linder T and Yüksel S An Asymptotically Optimal Two-Part Coding Scheme for Networked Control under Fixed-Rate Constraints 2022 IEEE International Symposium on Information Theory (ISIT), (1360-1365)
  35. Hosseinloo A and Dahleh M (2021). Deterministic policy gradient algorithms for semi‐Markov decision processes, International Journal of Intelligent Systems, 37:7, (4008-4019), Online publication date: 26-May-2022.
  36. Agarwal M, Aggarwal V and Lan T Multi-Objective Reinforcement Learning with Non-Linear Scalarization Proceedings of the 21st International Conference on Autonomous Agents and Multiagent Systems, (9-17)
  37. He X, Wang S, Wang X, Xu S and Ren J Age-Based Scheduling for Monitoring and Control Applications in Mobile Edge Computing Systems IEEE INFOCOM 2022 - IEEE Conference on Computer Communications, (1009-1018)
  38. ACM
    Yilmaz E, Ji T, Ayday E and Li P Genomic Data Sharing under Dependent Local Differential Privacy Proceedings of the Twelfth ACM Conference on Data and Application Security and Privacy, (77-88)
  39. K.S. A, Singh C, Maguluri S and Parag P (2022). Optimal pricing in multi server systems, Performance Evaluation, 154:C, Online publication date: 1-Apr-2022.
  40. Yu H, Shang J and Chen T (2022). Stochastic event-based LQG control, Automatica (Journal of IFAC), 138:C, Online publication date: 1-Apr-2022.
  41. ACM
    Luo Y, Gupta V and Kolar M (2022). Dynamic Regret Minimization for Control of Non-stationary Linear Dynamical Systems, Proceedings of the ACM on Measurement and Analysis of Computing Systems, 6:1, (1-72), Online publication date: 24-Feb-2022.
  42. Jiang H and Zhou B (2021). Bias-policy iteration based adaptive dynamic programming for unknown continuous-time linear systems, Automatica (Journal of IFAC), 136:C, Online publication date: 1-Feb-2022.
  43. Liu M, Pedrielli G and Cao Y Partitioning and gaussian processes for accelerating sampling in Monte Carlo tree search for continuous decisions Proceedings of the Winter Simulation Conference, (1-13)
  44. Wang B, Yan Y and Fan J Sample-efficient reinforcement learning for linearly-parameterized MDPs with a generative model Proceedings of the 35th International Conference on Neural Information Processing Systems, (23009-23022)
  45. Sassano M and Astolfi A (2021). Constructive design of open-loop Nash equilibrium strategies that admit a feedback synthesis in LQ games, Automatica (Journal of IFAC), 133:C, Online publication date: 1-Nov-2021.
  46. Lin S, Lai Y, Huang Y, Wang C and Wang I Optimal finite-length linear codes and the corresponding channel dispersion for broadcast packet erasure channels with feedback 2021 IEEE Information Theory Workshop (ITW), (1-6)
  47. Ma A, Ouimet M and Cortés J (2021). Temporal sampling annealing schemes for receding horizon multi-agent planning, Robotics and Autonomous Systems, 143:C, Online publication date: 1-Sep-2021.
  48. ACM
    Chelmis C and Zois D (2021). Dynamic, Incremental, and Continuous Detection of Cyberbullying in Online Social Media, ACM Transactions on the Web, 15:3, (1-33), Online publication date: 31-Aug-2021.
  49. ACM
    Eliyahu T, Kazak Y, Katz G and Schapira M Verifying learning-augmented systems Proceedings of the 2021 ACM SIGCOMM 2021 Conference, (305-318)
  50. ACM
    Li K, Lu N, Zheng J, Zhang P, Ni W and Tovar E (2021). BloothAir, ACM Transactions on Cyber-Physical Systems, 5:3, (1-22), Online publication date: 31-Jul-2021.
  51. ACM
    Yao G, Bedewy A and Shroff N Battle between Rate and Error in Minimizing Age of Information Proceedings of the Twenty-second International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing, (121-130)
  52. ACM
    Pan J, Bedewy A, Sun Y and Shroff N Minimizing Age of Information via Scheduling over Heterogeneous Channels Proceedings of the Twenty-second International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing, (111-120)
  53. ACM
    Zou Y, Kim K, Lin X and Chiang M Minimizing Age-of-Information in Heterogeneous Multi-Channel Systems Proceedings of the Twenty-second International Symposium on Theory, Algorithmic Foundations, and Protocol Design for Mobile Networks and Mobile Computing, (11-20)
  54. Liu J (2021). On the convergence of reinforcement learning with Monte Carlo Exploring Starts, Automatica (Journal of IFAC), 129:C, Online publication date: 1-Jul-2021.
  55. Dębski R and Sniezynski B Pruned Simulation-Based Optimal Sailboat Path Search Using Micro HPC Systems Computational Science – ICCS 2021, (158-172)
  56. Liao J, Liu C and Liu H Model Predictive Control for Cooperative Hunting in Obstacle Rich and Dynamic Environments 2021 IEEE International Conference on Robotics and Automation (ICRA), (5089-5095)
  57. ACM
    Majumdar R and Soudjani S The computability of LQR and LQG control Proceedings of the 24th International Conference on Hybrid Systems: Computation and Control, (1-7)
  58. Wang H, Lin S, Jafarkhani H and Zhang J Distributed Q-Learning with State Tracking for Multi-agent Networked Control Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems, (1692-1694)
  59. Congeduti E, Mey A and Oliehoek F Loss Bounds for Approximate Influence-Based Abstraction Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems, (377-385)
  60. Choudhury S, Gupta J, Morales P and Kochenderfer M Scalable Anytime Planning for Multi-Agent MDPs Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems, (341-349)
  61. Guan T and Frey C Predictive energy efficiency optimization of an electric vehicle using information about traffic light sequences and other vehicles 2016 IEEE 19th International Conference on Intelligent Transportation Systems (ITSC), (919-926)
  62. Khina A, Pettersson G, Kostina V and Hassibi B Multi-rate control over AWGN channels via analog joint source-channel coding 2016 IEEE 55th Conference on Decision and Control (CDC), (5968-5973)
  63. Sasabe M and Hara T (2021). Capacitated Shortest Path Tour Problem-Based Integer Linear Programming for Service Chaining and Function Placement in NFV Networks, IEEE Transactions on Network and Service Management, 18:1, (104-117), Online publication date: 1-Mar-2021.
  64. Scampicchio A and Pillonetto G A convex approach to robust LQR 2020 59th IEEE Conference on Decision and Control (CDC), (3705-3710)
  65. Jain A and Morari M Computing the racing line using Bayesian optimization 2020 59th IEEE Conference on Decision and Control (CDC), (6192-6197)
  66. Shah D, Song D, Xu Z and Yang Y Sample efficient reinforcement learning via low-rank matrix estimation Proceedings of the 34th International Conference on Neural Information Processing Systems, (12092-12103)
  67. Tang Z, Feng Y, Zhang N, Peng J and Liu Q Off-policy interval estimation with lipschitz value iteration Proceedings of the 34th International Conference on Neural Information Processing Systems, (7887-7897)
  68. Plevrakis O and Hazan E Geometric exploration for online control Proceedings of the 34th International Conference on Neural Information Processing Systems, (7637-7647)
  69. Liu D, Liu Y, Xing Y, Ghosh S and Kapila V DDP-based Parachute Landing Optimization for a Humanoid 2020 IEEE International Symposium on Safety, Security, and Rescue Robotics (SSRR), (122-128)
  70. ACM
    Yang H, Liu X, Zhong S and Walid A Deep reinforcement learning for automated stock trading Proceedings of the First ACM International Conference on AI in Finance, (1-8)
  71. Maatouk A, Kriouile S, Assaad M and Ephremides A (2020). The Age of Incorrect Information: A New Performance Metric for Status Updates, IEEE/ACM Transactions on Networking, 28:5, (2215-2228), Online publication date: 1-Oct-2020.
  72. ACM
    Li K, Ni W, Emami Y, Shen Y, Severino R, Pereira D and Tovar E (2019). Design and Implementation of Secret Key Agreement for Platoon-based Vehicular Cyber-physical Systems, ACM Transactions on Cyber-Physical Systems, 4:2, (1-20), Online publication date: 30-Apr-2020.
  73. Liang M, Wang D and Liu D (2020). Improved value iteration for neural-network-based stochastic optimal control design, Neural Networks, 124:C, (280-295), Online publication date: 1-Apr-2020.
  74. Mandal J, Goswami A, Wang J and Tiwari M (2020). Optimization of vehicle speed for batches to minimize supply chain cost under uncertain demand, Information Sciences: an International Journal, 515:C, (26-43), Online publication date: 1-Apr-2020.
  75. Possieri C, Sassano M, Galeani S and Teel A (2020). The linear quadratic regulator for periodic hybrid systems, Automatica (Journal of IFAC), 113:C, Online publication date: 1-Mar-2020.
  76. Kárný M (2020). Fully probabilistic design unifies and supports dynamic decision making under uncertainty, Information Sciences: an International Journal, 509:C, (104-118), Online publication date: 1-Jan-2020.
  77. Tout H, Kara N, Talhi C and Mourad A (2022). Proactive machine learning-based solution for advanced manageability of multi-persona mobile computing, Computers and Electrical Engineering, 80:C, Online publication date: 1-Dec-2019.
  78. ACM
    Kiennert C, Ismail Z, Debar H and Leneutre J (2018). A Survey on Game-Theoretic Approaches for Intrusion Detection and Response Optimization, ACM Computing Surveys, 51:5, (1-31), Online publication date: 30-Sep-2019.
  79. Sharma H and Jain R An Approximately Optimal Relative Value Learning Algorithm for Averaged MDPs with Continuous States and Actions 2019 57th Annual Allerton Conference on Communication, Control, and Computing (Allerton), (734-740)
  80. Wu K, Abolfazli Esfahani M, Yuan S and Wang H (2019). TDPP-Net, Neurocomputing, 357:C, (151-162), Online publication date: 10-Sep-2019.
  81. Huang G and Tu W (2019). A high-throughput wireless-powered relay network with joint time and power allocations, Computer Networks: The International Journal of Computer and Telecommunications Networking, 160:C, (65-76), Online publication date: 4-Sep-2019.
  82. Lin S, Meng N and Li W Optimizing constraint solving via dynamic programming Proceedings of the 28th International Joint Conference on Artificial Intelligence, (1146-1154)
  83. Mönnigmann M (2019). On the structure of the set of active sets in constrained linear quadratic regulation, Automatica (Journal of IFAC), 106:C, (61-69), Online publication date: 1-Aug-2019.
  84. ACM
    Qin Z, Tang J and Ye J Deep Reinforcement Learning with Applications in Transportation Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, (3201-3202)
  85. Cooper B and Cowlagi R (2019). Interactive planning and sensing in unknown static environments with task-driven sensor placement, Automatica (Journal of IFAC), 105:C, (391-398), Online publication date: 1-Jul-2019.
  86. Parras J and Zazo S (2019). Learning attack mechanisms in Wireless Sensor Networks using Markov Decision Processes, Expert Systems with Applications: An International Journal, 122:C, (376-387), Online publication date: 15-May-2019.
  87. ACM
    Yao M, Chelmis C and Zois D Cyberbullying Ends Here: Towards Robust Detection of Cyberbullying in Social Media The World Wide Web Conference, (3427-3433)
  88. Liu Z and Wu H (2019). New insight into the simultaneous policy update algorithms related to H ∞ state feedback control, Information Sciences: an International Journal, 484:C, (84-94), Online publication date: 1-May-2019.
  89. Burra R, Singh C and Kuri J Service Scheduling for Bernoulli Requests and Quadratic Cost IEEE INFOCOM 2019 - IEEE Conference on Computer Communications, (2584-2592)
  90. Balseiro S and Brown D (2019). Approximations to Stochastic Dynamic Programs via Information Relaxation Duality, Operations Research, 67:2, (577-597), Online publication date: 1-Mar-2019.
  91. Li Y, Mehr A and Chen T (2019). Multi-sensor transmission power control for remote estimation through a SINR-based communication channel, Automatica (Journal of IFAC), 101:C, (78-86), Online publication date: 1-Mar-2019.
  92. Vaton S, Brun O, Mouchet M, Belzarena P, Amigo I, Prabhu B and Chonavel T (2019). Joint Minimization of Monitoring Cost and Delay in Overlay Networks, Journal of Network and Systems Management, 27:1, (188-232), Online publication date: 1-Jan-2019.
  93. Yin S and Tsumura K The Second Law of Controlled Linear Stochastic Thermodynamic Systems over a Noiseless Digital Channel 2018 IEEE Conference on Decision and Control (CDC), (2561-2566)
  94. Pike C, Novak A, Moran B, Kirszenblat D and Hill B A stochastic programming approach to optimal recruitment in australian naval aviation training Proceedings of the 2018 Winter Simulation Conference, (3753-3764)
  95. Lv B, Wang R, Cui Y and Tan H Joint Optimization of File Placement and Delivery in Cache-Assisted Wireless Networks 2018 IEEE Global Communications Conference (GLOBECOM), (1-7)
  96. Biswas S, Knorn S, Dey S and Ahlen A Quantized Non-Bayesian Quickest Change Detection with Energy Harvesting 2018 IEEE Global Communications Conference (GLOBECOM), (1-7)
  97. Liu T, Xun J, Yin J and Xiao X Optimal Train Control by Approximate Dynamic Programming: Comparison of Three Value Function Approximation Methods* 2018 21st International Conference on Intelligent Transportation Systems (ITSC), (2741-2746)
  98. ACM
    Grammatopoulou M, Kanellopoulos A and Vamvoudakis K A multi-step and resilient predictive Q-learning algorithm for IoT Proceedings of the 8th International Conference on the Internet of Things, (1-8)
  99. Nisioti E and Thomos N Decentralized Reinforcement Learning Based MAC Optimization 2018 IEEE 29th Annual International Symposium on Personal, Indoor and Mobile Radio Communications (PIMRC), (1-5)
  100. Imani M and Braga-Neto U (2018). Finite-horizon LQR controller for partially-observed Boolean dynamical systems, Automatica (Journal of IFAC), 95:C, (172-179), Online publication date: 1-Sep-2018.
  101. Sehr M and Bitmead R (2022). Stochastic output-feedback model predictive control, Automatica (Journal of IFAC), 94:C, (315-323), Online publication date: 1-Aug-2018.
  102. Rattaro C and Belzarena P (2018). Cognitive Radio Networks, Wireless Personal Communications: An International Journal, 101:4, (2053-2083), Online publication date: 1-Aug-2018.
  103. Anagnostopoulos C and Kolomvatsos K (2018). Predictive intelligence to the edge through approximate collaborative context reasoning, Applied Intelligence, 48:4, (966-991), Online publication date: 1-Apr-2018.
  104. Cadena J, Basak A, Vullikanti A and Deng X Graph scan statistics with uncertainty Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence and Thirtieth Innovative Applications of Artificial Intelligence Conference and Eighth AAAI Symposium on Educational Advances in Artificial Intelligence, (2771-2778)
  105. ACM
    Kim J, Tabibian B, Oh A, Schölkopf B and Gomez-Rodriguez M Leveraging the Crowd to Detect and Reduce the Spread of Fake News and Misinformation Proceedings of the Eleventh ACM International Conference on Web Search and Data Mining, (324-332)
  106. Khina A, Nakahira Y, Su Y and Hassibi B Algorithms for optimal control with fixed-rate feedback 2017 IEEE 56th Annual Conference on Decision and Control (CDC), (6015-6020)
  107. Škach J, Straka O and Punčochář I Efficient active fault diagnosis using adaptive particle filter 2017 IEEE 56th Annual Conference on Decision and Control (CDC), (5732-5738)
  108. Hung S, Zhang X, Festag A, Chen K and Fettweis G Virtual Cells and Virtual Networks Enablelow-Latency Vehicle-to-Vehicle Communication GLOBECOM 2017 - 2017 IEEE Global Communications Conference, (1-7)
  109. Zafeiriou L, Panagakis Y, Pantic M and Zafeiriou S (2017). Nonnegative Decompositions for Dynamic Visual Data Analysis, IEEE Transactions on Image Processing, 26:12, (5603-5617), Online publication date: 1-Dec-2017.
  110. ACM
    Ghosh A, Chattopadhyay A, Arora A and Kumar A (2017). Measurement Based As-You-Go Deployment of Two-Connected Wireless Relay Networks, ACM Transactions on Sensor Networks, 13:3, (1-23), Online publication date: 31-Aug-2017.
  111. ACM
    Anagnostopoulos C and Triantafillou P (2017). Query-Driven Learning for Predictive Analytics of Data Subspace Cardinality, ACM Transactions on Knowledge Discovery from Data, 11:4, (1-46), Online publication date: 21-Aug-2017.
  112. Azar M, Osband I and Munos R Minimax regret bounds for reinforcement learning Proceedings of the 34th International Conference on Machine Learning - Volume 70, (263-272)
  113. Yazidi A and John Oommen B (2017). A novel technique for stochastic root-finding, Information Sciences: an International Journal, 393:C, (108-129), Online publication date: 1-Jul-2017.
  114. Guan T and Frey C Improvement of predictive energy efficiency optimization using long distance horizon estimation 2017 IEEE Intelligent Vehicles Symposium (IV), (1249-1255)
  115. Butkova Y, Wimmer R and Hermanns H Long-Run Rewards for Markov Automata Proceedings, Part II, of the 23rd International Conference on Tools and Algorithms for the Construction and Analysis of Systems - Volume 10206, (188-203)
  116. Cowlagi R (2017). Hierarchical trajectory optimization for a class of hybrid dynamical systems, Automatica (Journal of IFAC), 77:C, (112-119), Online publication date: 1-Mar-2017.
  117. ACM
    Feinberg E and Liang Y (2016). Structure of Optimal Solutions to Periodic-Review Total-Cost Stochastic Inventory Control Problems, ACM SIGMETRICS Performance Evaluation Review, 44:2, (21-23), Online publication date: 29-Sep-2016.
  118. Becerra I, Valentín-Coronado L, Murrieta-Cid R and Latombe J (2016). Reliable confirmation of an object identity by a mobile robot, International Journal of Robotics Research, 35:10, (1207-1233), Online publication date: 1-Sep-2016.
  119. Yoo O, Corbett C and Roels G (2016). Optimal Time Allocation for Process Improvement for Growth-Focused Entrepreneurs, Manufacturing & Service Operations Management, 18:3, (361-375), Online publication date: 1-Jul-2016.
  120. Xin Y, Shayman M, La R and Marcus S (2016). Reconfiguration of survivable IP over WDM networks, Optical Switching and Networking, 21:C, (93-100), Online publication date: 1-Jul-2016.
  121. Murat A, Laporte G and Verter V (2016). A global shooting algorithm for the facility location and capacity acquisition problem on a line with dense demand, Computers and Operations Research, 71:C, (1-15), Online publication date: 1-Jul-2016.
  122. Master N, Dua A, Tsamis D, Singh J and Bambos N (2016). Adaptive Prefetching in Wireless Computing, IEEE Transactions on Wireless Communications, 15:5, (3296-3310), Online publication date: 1-May-2016.
  123. Shi C, Chen W and Duenyas I (2016). Technical Note—Nonparametric Data-Driven Algorithms for Multiproduct Inventory Systems with Censored Demand, Operations Research, 64:2, (362-370), Online publication date: 1-Apr-2016.
  124. Zhang W, Moustakides G and Poor H (2016). Opportunistic Detection Rules: Finite and Asymptotic Analysis, IEEE Transactions on Information Theory, 62:4, (2140-2152), Online publication date: 1-Apr-2016.
  125. Chatterjee K, Chmelík M and Davies J A symbolic SAT-based algorithm for almost-sure reachability with small strategies in POMDPs Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, (3225-3232)
  126. Farahmand A, Nikovski D, Igarashi Y and Konaka H Truncated approximate dynamic programming with task-dependent terminal value Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, (3123-3129)
  127. ACM
    Yin X, Jindal A, Sekar V and Sinopoli B (2015). A Control-Theoretic Approach for Dynamic Adaptive Video Streaming over HTTP, ACM SIGCOMM Computer Communication Review, 45:4, (325-338), Online publication date: 22-Sep-2015.
  128. ACM
    Yin X, Jindal A, Sekar V and Sinopoli B A Control-Theoretic Approach for Dynamic Adaptive Video Streaming over HTTP Proceedings of the 2015 ACM Conference on Special Interest Group on Data Communication, (325-338)
  129. ACM
    Bai A, Wu F and Chen X (2015). Online Planning for Large Markov Decision Processes with Hierarchical Decomposition, ACM Transactions on Intelligent Systems and Technology, 6:4, (1-28), Online publication date: 13-Aug-2015.
  130. Kratochvil V and Vomlel J Influence diagrams for the optimization of a vehicle speed profile Proceedings of the Twelfth UAI Conference on Bayesian Modeling Applications Workshop - Volume 1565, (44-53)
  131. ACM
    Naveen K and Kumar A (2015). Relay Selection with Channel Probing in Sleep-Wake Cycling Wireless Sensor Networks, ACM Transactions on Sensor Networks, 11:3, (1-38), Online publication date: 28-May-2015.
  132. ACM
    Jain M, Khadilkar H, Sengupta N, Charbiwala Z, Tennakoon K, Wahab R, De Silva L and Seetharam D Collaborative energy conservation in a microgrid Proceedings of the 1st ACM Conference on Embedded Systems for Energy-Efficient Buildings, (130-139)
  133. Hou P, Yeoh W and Varakantham P Revisiting risk-sensitive MDPs Proceedings of the Twenty-Fourth International Conferenc on International Conference on Automated Planning and Scheduling, (136-144)
  134. ACM
    Freire A, Macdonald C, Tonellotto N, Ounis I and Cacheda F A self-adapting latency/power tradeoff model for replicated search engines Proceedings of the 7th ACM international conference on Web search and data mining, (13-22)
  135. ACM
    Al-Dujaily R, Dahir N, Mak T, Xia F and Yakovlev A (2013). Dynamic programming-based runtime thermal management (DPRTM), ACM Transactions on Design Automation of Electronic Systems, 19:1, (1-27), Online publication date: 1-Dec-2013.
  136. Goodson J, Ohlmann J and Thomas B (2013). Rollout Policies for Dynamic Solutions to the Multivehicle Routing Problem with Stochastic Demand and Duration Limits, Operations Research, 61:1, (138-154), Online publication date: 1-Jan-2013.
  137. Alizamir S, de Véricourt F and Sun P (2013). Diagnostic Accuracy Under Congestion, Management Science, 59:1, (157-171), Online publication date: 1-Jan-2013.
  138. Sopasakis P and Sarimveis H (2012). An integer programming approach for optimal drug dose computation, Computer Methods and Programs in Biomedicine, 108:3, (1022-1035), Online publication date: 1-Dec-2012.
  139. Cooper W and Rangarajan B (2012). Performance Guarantees for Empirical Markov Decision Processes with Applications to Multiperiod Inventory Models, Operations Research, 60:5, (1267-1281), Online publication date: 1-Sep-2012.
  140. Wu O, Wang D and Qin Z (2012). Seasonal Energy Storage Operations with Limited Flexibility, Manufacturing & Service Operations Management, 14:3, (455-471), Online publication date: 1-Jul-2012.
  141. Caro F and Martínez-de-Albéniz V (2012). Product and Price Competition with Satiation Effects, Management Science, 58:7, (1357-1373), Online publication date: 1-Jul-2012.
  142. Paté-Cornell M (2012). Games, Risks, and Analytics, Decision Analysis, 9:2, (186-203), Online publication date: 1-Jun-2012.
  143. ACM
    Narayanaswamy B, Garg V and Jayram T Online optimization for the smart (micro) grid Proceedings of the 3rd International Conference on Future Energy Systems: Where Energy, Computing and Communication Meet, (1-10)
  144. ACM
    Karumbu P, Prasanthi V and Kumar A (2012). Delay optimal event detection on ad hoc wireless sensor networks, ACM Transactions on Sensor Networks, 8:2, (1-35), Online publication date: 1-Mar-2012.
  145. KC D and Terwiesch C (2012). An Econometric Analysis of Patient Flows in the Cardiac Intensive Care Unit, Manufacturing & Service Operations Management, 14:1, (50-65), Online publication date: 1-Jan-2012.
  146. Furmston T and Barber D Lagrange dual decomposition for finite horizon Markov decision processes Proceedings of the 2011 European conference on Machine learning and knowledge discovery in databases - Volume Part I, (487-502)
  147. Furmston T and Barber D Lagrange dual decomposition for finite horizon Markov Decision Processes Proceedings of the 2011th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part I, (487-502)
  148. Lai G, Wang M, Kekre S, Scheller-Wolf A and Secomandi N (2011). Valuation of Storage at a Liquefied Natural Gas Terminal, Operations Research, 59:3, (602-616), Online publication date: 1-May-2011.
  149. ACM
    Tsourakakis C, Peng R, Tsiarli M, Miller G and Schwartz R (2011). Approximation algorithms for speeding up dynamic programming and denoising aCGH data, ACM Journal of Experimental Algorithmics, 16, (1.1-1.27), Online publication date: 1-May-2011.
  150. ACM
    Cámara J, Girard A and Gössler G Synthesis of switching controllers using approximately bisimilar multiscale abstractions Proceedings of the 14th international conference on Hybrid systems: computation and control, (191-200)
  151. Miller G, Peng R, Schwartz R and Tsourakakis C Approximate dynamic programming using halfspace queries and multiscale Monge decomposition Proceedings of the twenty-second annual ACM-SIAM symposium on Discrete algorithms, (1675-1682)
  152. Xu Y, Bisi A and Dada M (2010). New structural properties of (s,S) policies for inventory models with lost sales, Operations Research Letters, 38:5, (441-449), Online publication date: 1-Sep-2010.
  153. Kim S and Giannakis G (2010). Sequential and cooperative sensing for multi-channel cognitive radios, IEEE Transactions on Signal Processing, 58:8, (4239-4253), Online publication date: 1-Aug-2010.
  154. Osais Y, Yu F and St-Hilaire M Thermal management of biosensor networks Proceedings of the 7th IEEE conference on Consumer communications and networking conference, (249-253)
  155. Ramírez-Hernández J and Fernandez E A simulation-based approximate dynamic programming approach for the control of the Intel Mini-Fab benchmark model Winter Simulation Conference, (1634-1645)
  156. Ji G and Liang B Stochastic rate control for scalable VBR video streaming over wireless networks Proceedings of the 28th IEEE conference on Global telecommunications, (5924-5929)
  157. Pahliani A, Spaan M and Lima P Decision-theoretic robot guidance for active cooperative perception Proceedings of the 2009 IEEE/RSJ international conference on Intelligent robots and systems, (4837-4842)
  158. Hong C and Tewfik A (2009). Heuristic Reusable Dynamic Programming, IEEE/ACM Transactions on Computational Biology and Bioinformatics, 6:4, (570-582), Online publication date: 1-Oct-2009.
  159. Aggarwal R, Schniter P and Koksal C (2009). Rate adaptation via link-layer feedback for goodput maximization over a time-varying channel, IEEE Transactions on Wireless Communications, 8:8, (4276-4285), Online publication date: 1-Aug-2009.
  160. Cowlagi R and Tsiotras P Shortest distance problems in graphs using history-dependent transition costs with application to kinodynamic path planning Proceedings of the 2009 conference on American Control Conference, (414-419)
  161. Ernst D, Glavic M, Capitanescu F and Wehenkel L (2009). Reinforcement learning versus model predictive control, IEEE Transactions on Systems, Man, and Cybernetics, Part B: Cybernetics, 39:2, (517-529), Online publication date: 1-Apr-2009.
  162. Kim S and Giannakis G (2009). Rate-optimal and reduced-complexity sequential sensing algorithms for cognitive OFDM radios, EURASIP Journal on Advances in Signal Processing, 2009, (1-11), Online publication date: 1-Mar-2009.
  163. ACM
    Yen L, Saerens M, Mantrach A and Shimbo M A family of dissimilarity measures between nodes generalizing both the shortest-path and the commute-time distances Proceedings of the 14th ACM SIGKDD international conference on Knowledge discovery and data mining, (785-793)
  164. Sintzoff M Synthesis of Optimal Control Policies for Some Infinite-State Transition Systems Proceedings of the 9th international conference on Mathematics of Program Construction, (336-359)
  165. Dahl G and Minken H (2008). Methods based on discrete optimization for finding road network rehabilitation strategies, Computers and Operations Research, 35:7, (2193-2208), Online publication date: 1-Jul-2008.
  166. Dua A and Bambos N (2007). Downlink Wireless Packet Scheduling with Deadlines, IEEE Transactions on Mobile Computing, 6:12, (1410-1425), Online publication date: 1-Dec-2007.
  167. ACM
    Jiang S, Xue Y and Schmidt D Disruption-aware service composition and recovery in dynamic networking environments Proceedings of the 2007 workshop on Automating service quality: Held at the International Conference on Automated Software Engineering (ASE), (28-33)
  168. ACM
    Dua A, Bambos N and Singh J Performance tradeoffs in mobile computing Proceedings of the 5th ACM international workshop on Mobility management and wireless access, (99-106)
  169. Tavazoei M and Haeri M (2007). An optimization algorithm based on chaotic behavior and fractal nature, Journal of Computational and Applied Mathematics, 206:2, (1070-1081), Online publication date: 20-Sep-2007.
  170. ACM
    Jureta I, Faulkner S, Achbany Y and Saerens M Dynamic task allocation within an open service-oriented MAS architecture Proceedings of the 6th international joint conference on Autonomous agents and multiagent systems, (1-3)
  171. Tarraf D, Megretski A and Dahleh M Finite state controllers for stabilizing switched systems with binary sensors Proceedings of the 10th international conference on Hybrid systems: computation and control, (543-556)
  172. ACM
    Neglia G and Zhang X Optimal delay-power tradeoff in sparse delay tolerant networks Proceedings of the 2006 SIGCOMM workshop on Challenged networks, (237-244)
  173. Cooper W, Homem-de-Mello T and Kleywegt A (2006). Models of the Spiral-Down Effect in Revenue Management, Operations Research, 54:5, (968-987), Online publication date: 1-Sep-2006.
  174. ACM
    Tang J and Zhang X Cross-layer-model based adaptive resource allocation for statistical QoS guarantees in mobile wireless networks Proceedings of the 3rd international conference on Quality of service in heterogeneous wired/wireless networks, (44-es)
  175. Roy S, Herlugson K and Saberi A (2006). A Control-Theoretic Approach to Distributed Discrete-Valued Decision-Making in Networks of Sensing Agents, IEEE Transactions on Mobile Computing, 5:8, (945-957), Online publication date: 1-Aug-2006.
  176. Song H, Yang U, Lee S and Sohn K 3D face recognition based on facial shape indexes with dynamic programming Proceedings of the 2006 international conference on Advances in Biometrics, (99-105)
  177. Tarello A, Sun J, Zafer M and Modiano E Minimum Energy Transmission Scheduling Subject to Deadline Constraints Proceedings of the Third International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks, (67-76)
  178. England D and Weissman J A Stochastic Control Model for Deployment of Dynamic Grid Services Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing, (192-199)
  179. Mannor S, Rubinstein R and Gat Y The cross entropy method for fast policy search Proceedings of the Twentieth International Conference on International Conference on Machine Learning, (512-519)
  180. ACM
    Arikan O, Forsyth D and O'Brien J Motion synthesis from annotations ACM SIGGRAPH 2003 Papers, (402-408)
  181. ACM
    Arikan O, Forsyth D and O'Brien J (2003). Motion synthesis from annotations, ACM Transactions on Graphics, 22:3, (402-408), Online publication date: 1-Jul-2003.
  182. Nikovski D and Brand M Decision-theoretic group elevator scheduling Proceedings of the Thirteenth International Conference on International Conference on Automated Planning and Scheduling, (133-142)
  183. Nikovski D and Brand M Marginalizing out future passengers in group elevator control Proceedings of the Nineteenth conference on Uncertainty in Artificial Intelligence, (443-450)
  184. Schubert K and Bambos N Data aggregation for low power wireless devices MILCOM 2016 - 2016 IEEE Military Communications Conference, (97-102)
  185. Guan T and Frey C Unified predictive fuel efficiency optimization using traffic light sequence information 2016 IEEE Intelligent Vehicles Symposium (IV), (1103-1108)
  186. Ma K, Liu L and Sukhatme G An information-driven and disturbance-aware planning method for long-term ocean monitoring 2016 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), (2102-2108)
  187. Guan T and Frey C Predictive energy efficiency optimization of an electric vehicle using traffic light sequence information* 2016 IEEE International Conference on Vehicular Electronics and Safety (ICVES), (1-6)
  188. Andersson O, Wzorek M, Rudol P and Doherty P Model-predictive control with stochastic collision avoidance using Bayesian policy optimization 2016 IEEE International Conference on Robotics and Automation (ICRA), (4597-4604)
Contributors
  • School of Computing and Augmented Intelligence

Recommendations