用户名: 密码: 验证码:
近似动态规划方法及其在交通中的应用
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
论文研究了近似动态规划方法及其在交通中的应用,丰富和完善了近似动态规划理论。论文主要研究内容和创新点总结如下:
     一、提出了一种近似动态规划网络优化加速算法。在权值调整期间加入前期权值信息,增强了训练的稳定性;使用Steffensen迭代算法进行加速,使网络训练较快地收敛,有效地解决了传统神经网络收敛慢的缺点,在此基础上,给出了一种基于数值计算的近似动态规划改进方法ADHDP(D),仿真结果表明该方法误差稳步下降,没有出现ADHDP方法中振荡的现象,且达到收敛稳定的速度更快。此外,提出了一种权值初始值复合修正法,仿真结果表明,与权值初始值随机设定方法相比,该方法提高了近似动态规划方法的学习成功率;
     二、研究了快速路交通流模型参数辨识方法。针对交通流模型的强非线性、不确定性等特点,提出了基于近似动态规划的交通流模型参数辨识算法。该算法具有自学习和自适应的特性,不依赖于被控对象的解析模型,严格的理论推导证明了这种参数辨识方案的收敛性,仿真结果验证了该算法的有效性;
     三、研究了快速路短时交通流预测方法。针对指数平滑法缺乏有效的参数选取方法,提出了一种基于近似动态规划方法的自适应单指数平滑法,结合实际交通流数据对指数平滑系数进行优化,使其随预测过程自动更新,从而保证了预测的实时性、准确性。严格的理论推导证明了这种预测方法的收敛性,仿真结果验证了算法的有效性;
     四、研究了城市交叉口均衡控制的最优信号配时问题。针对过饱和交叉口提出了排队长度均衡的控制目标,设计了基于排队长度均衡的两相位和三相位绿灯时间近似动态规划控制算法。仿真结果表明基于近似动态规划的控制算法可以根据实时交通车流信息实现绿灯时间的自适应调整,克服了定时控制不能随着流量的变化而分配绿灯时间的缺点。此外,以三相位交叉口最优信号配时为例,讨论了神经网络权值初始值对算法收敛结果的影响;
     五、研究了快速路系统中单入口匝道以及多入口匝道基于近似动态规划的控制算法。针对交通流强非线性、不确定性等特点,设计了基于近似动态规化的控制器,避开了交通流建模难的问题。仿真结果表明控制器具有良好的暂态性能,能够适应实时变化的交通状况,平滑交通流,在一定程度上缓解了交通拥堵。
This dissertation focused on some issues on approximate dynamic programming (ADP) and its applications in transportation. The main work and key contributions were summarized as the following:
     1. Based on Steffensen's method and former weights'values, a new optimized accelerated algorithm was presented for neural network. On the basis of this new optimized accelerated algorithm, a new ADHDP(Action-Dependent Heuristic Dynamic Programming) method based on former weights'value data (ADHDP(D) for short) was proposed. A detailed analysis was made between the ADHDP and ADHDP(D). Simulation results showed the good performance of this new algorithm. Furthermore, an initial weights compositional method was proposed. Simulation results demonstrated the convergence property is improved effectively in comparison with initial weights randomly preassigned.
     2. Considering on that traffic system is a strong nonlinear and uncertain system, an identification method based on ADP was developed to estimate the parameters of the general discrete-time nonlinear traffic flow system. With rigorous analysis, it was shown that the proposed identification scheme, independent of the precise traffic flow model, can guarantee the convergence. A number of simulation results were provided to the efficacy of the proposed approach.
     3. An adaptive single-exponent smoothing based on ADP was put forward to select the smoothing coefficient dynamically. With rigorous analysis, it was shown that the proposed predictionscheme can guarantee the convergence. The simulation results verifid the effectiveness of the proposed algorithm.
     4. The optimal signal timing problem was investigated for an urban intersection.Based on the technique ADP, the optimal signal timing controllers were proposed for the two-phase intersection and three-phase intersection, respetively. The simulation examples showed that the control algorithm based on ADP can allocate green time rationally and achieve the equilibrium of queue length. Furthermore, taking the optimal signal timing of three-phase intersection for example, simulations verifid that the initial weights affect the convergence result.
     5. Based on the technique of ADP, the problems of local ramp and coordinated ramp metering were discussed. The ADP controller was designed to avoid the difficulty of traffic flow modeling. Simulation results demonstrated the new controllers have better transient response, prevent congestion and increase traffic throughput.
引文
[1]D. P. Bertsekas. Dynamic Programming: Deterministic and Stochastic Models. Prentice-Hall, Inc. Upper Saddle River, NJ, USA,1987.
    [2]B. Widrow, N. Gupta, and S. Maitra. Punish/reward: Learning with a critic in adaptive threshold systems. IEEE Transactions on Systems, Man, Cybernetics,1973,3(5):455-465.
    [3]C. Y. Seong, B. Widrow. Neural Dynamic Optimization for Control Systems. IEEE Transactions on Systems, Man, Cybernetics,2001,31(4):482-489.
    [4]P. J. Werbos. Consisteney of HDP applied to a simple reinforcement learning problem. Neural Networks,1990,3(22):179-189.
    [5]R. E. Saeks, C. J. Cox, J. C. Neidhoefer, et al. Neural adaptive control of LoFLYTE. Proceedings of American Control Conference,2001,2913-2917.
    [6]D. R. Liu, H. G. Zhang. A Neural Dynamic Programming Approach for Learning Control of Failure Avoidance Problems. International Journal of intelligent Contorl and Systems,2005, 10(1):21-32.
    [7]A. G. Barto, R. S. Sutton, C. W. Anderson. Neuronlike adaptive Elements that Can Solve Diffieult Learning Control Problems. IEEE Transactions on Systems, Man and Cybernetics, 1983,13(5):835-846.
    [8]C. C. Watkins, P. Dayan. Q-learning. Machine Learning,1992,8:279-292.
    [9]P. J. Werbos. Back propagation through time:What it is and how to do it. Proceedings of the IEEE,1990,78(10):1550-1560.
    [10]赵冬斌,刘德荣,易建强.基于自适应动态规划的城市交通信号优化控制方法综述,自动化学报,2009,35(6):676-680.
    [11]C. Chen, K. W. Chi, G. H. Benjamin. Adaptive traffic signal control using approximate dynamic programming. Transportation Research Part C,2009,17:456-474.
    [12]A. T. Asma, L. L. Frank, A. K. Murad. Model-free Q-learning designs for linear discrete-time zero-sum games with application to H-infinity control. Automatica,2007,43:473-481.
    [13]J. M. Lee, J. H. Lee. Approximate dynamic programming-based approaches for input-output data-driven control of nonlinear processes. Automatica,2005,41:1281-1288.
    [14]L. Buoniu, D. Ernst, B. D. Schutter, R. Babukaa. Approximate dynamic programming with a fuzzy parameterization. Automatica,2010,46:804-814.
    [15]F. L. Lewis, D. Vrabie. Reinforcement Learning and Adaptive Dynamic Programming for feedback control. IEEE Circuits and Systems Magazine,2009,32-50.
    [16]J. H. Lee, J. M. Lee. Approximate dynamic programming based approach to process control and scheduling. Computers& Chemical Engineering,2006,30:1603-1618.
    [17]J. H. Lee, W. Wong. Approximate dynamic programming approach for process control. Journal of Process Control,2010,20:1038-1048.
    [18]Q. L. Wei, H. G. Zhang, J. Dai. Model-free multiobjective approximate dynamic programming for discrete-time nonlinear systems with general performance index functions. Neurocomputing,2009,72:1839-1848.
    [19]J. M. Lee, N. S. Kaisare, J. H. Lee. Choice of approximator and design of penalty function for an approximate dynamic programming based control approach. Journal of Process Control, 2006,16:135-156.
    [20]D. R. Liu, X. X. Xiong, Y. Zhang. Action-dependent adaptive critic designs. Proceedings of International Joint Conference on Neural Networks,2001,2:15-19.
    [21]D. Liu, H. Javaherian, O. Kovalenko, T. Huang. Adaptive Critic Learning Techniques for Engine Torque and Air-Fuel Ratio Control. IEEE Transactions on Systems, Man and Cybernetics,2008,38:988-993.
    [22]D. R. Liu. Approximate Dynamic Programming for Self-Learning Control. ACTA Automatica Sinica,2005,31(1):13-17.
    [23]D. Liu, H. D. Patino. Adaptive critic designs for self-learning ship steering control. Intelligent Control/Intelligent Systems and Semiotics. Proceedings of IEEE International Symposium, 1999,46-51.
    [24]V. Danil, C. Donald, II. Wunsch. Adaptive Critic Designs. IEEE Transactions on Neural Networks,1997,8(5):997-1007.
    [25]J. Ma, T. Yang, Z.G. Hou, M. Tan, D.R Liu. Dual Heuristic Programming Based Neurocontroller for Vibration Isolation Control. Proceedings of IEEE International Conference on Networking, Sensing and Control,2008,874-879.
    [26]H. Javaherian, D. R. Liu, O. Kovalenko. Automotive Engine Torque and Air-Fuel Ratio Control Using Dual Heuristic Dynamic Programming. Proceedings of International Joint Conference on Neural Networks,2006,518-525.
    [27]O. Kovalenko, D. R. Liu, H. Javaherian. Neural network modeling and adaptive critic control of automotive fuel-injection systems. Proceedings of IEEE International Symposium on Intelligent Control,2004,368-373.
    [28]H. Javaherian, D. R. Liu, Y. Zhang, O. Kovalenko. Adaptive critic learning techniques for automotive engine control. Proceedings of American Control Conference,2004,5:4066-4071.
    [29]D. R. Liu, Y. Zhang. A self-learning adaptive critic approach for call admission control in wireless cellular networks. Proceedings of IEEE International Conference on Communications, 2003,3:1853-1857.
    [30]C. Cai. An Approximate Dynamic Programming Strategy for Responsive Traffic Signal Control. Proceedings of IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning,2007,303-310.
    [31]R. Hernandez, E. Fernandez. An Approximate Dynamic Programming Approach for Job Releasing and Sequencing in a Reentrant Manufacturing Line. Proceeding of IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning,2007,201-208.
    [32]R. Padhi, S. N. Balakrishnan. Approximate dynamic programming based optimal neurocontrol synthesis of a chemical reactor process using proper orthogonal decomposition. Proceedings of the International Joint Conference on Neural Networks,2003,3:1891-1896.
    [33]S. Shervais, T. T. Shannon. Improving theoretically-optimal and quasi-optimal inventory and transportation policies using adaptive critic based approximate dynamic programming. Proceedings of International Joint Conference on Neural Networks,2001,2:1008-1013.
    [34]J. Murray, C. Cox, R. Saeks, G. Lendaris. Globally convergent approximate dynamic programming applied to an autolander. Proceedings of American Control Conference,2001,4: 2901-2906.
    [35]S. Ferrari, J. E. Steck, R. Chandramohan.Adaptive Feedback Control by Constrained Approximate Dynamic Programming. IEEE Transactions on Systems, Man and Cybernetics, 2008,38(4):982-987.
    [36]J. Seiffertt, S. Sanyal, D. C. Wunsch. Hamilton-Jacobi-Bellman Equations and Approximate Dynamic Programming on Time Scales. IEEE Transactions on Systems, Man and Cybernetics, 2008,38(4):918-923.
    [37]C. V. Peroni, N. S. Kaisare, J. H. Lee. Optimal control of a fed-batch bioreactor using simulation-based approximate dynamic programming. IEEE Transactions on Control Systems Technology,2005,13(5):786-790.
    [38]B. Bethke, A. Ozdaglar. Approximate dynamic programming using support vector regression. Proceedings of 47th IEEE Conference on Decision and Control,2008,3811-3816.
    [39]W. B. Powell. Approximate dynamic programming:Lessons from the field. Proceedings of Simulation Conference on winter simulation Conference,2008,205-214.
    [40]R. E. Saeks, C. J. Cox. K. Mathia. A. J. Maren. Asymptotic dynamic programming: preliminary concepts and results. Proceedings of International Conference on Neural Networks,1997,4:2273-2278.
    [41]M. Baglietto, G. Battistelli, F. Vitali. Shortest path problems on stochastic graphs:a neuro dynamic programming approach. Proceedings of the 42nd IEEE Conference on Decision and Control 2003,6:6187-6193.
    [42]P. Marbach, J. N. Tsitsiklis. A neuro-dynamic programming approach to admission control in ATM networks:the single link case. Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing,1997,1:159-162.
    [43]J. N. Tsitsiklis, B. V. Roy, Neuro-dynamic programming overview and a case study in optimal stopping. Proceedings of the 36th IEEE Conference on Decision and Control,1997,2: 1181-1186.
    [44]B. V. Roy, D.P.Bertsekas, Y. Lee, J.N.Tsitsiklis. A neuro-dynamic programming approach to retailer inventory management. Proceedings of the 36th IEEE Conference on Decision and Control,1997,4:4052-4057.
    [45]J. Si, Y. T. Wang. Neuro-dynamic programming based on self-organized patterns. Proceedings of IEEE International Symposium on Intelligent Control/Intelligent Systems and Semiotics, 1999,120-125.
    [46]D. P. Bertsekas, J. N. Tsitsiklis. Neuro-dynamic programming:an overview. Proceedings of the 34th IEEE Conference on Decision and Control,1995,1:560-564.
    [47]H. Patino, J. Pucheta, R. Fullana, C. Schugurensky, B. Kuchen. Neuro-dynamic programming-based optimal control for crop growth in precision agriculture. Proceedings of IEEE International Symposium on Intelligent Control,2004,397-402.
    [48]Y. Lei, J. Si. Optimal etch time control design using neuro-dynamic programming. Proceedings of IEEE International Semiconductor Manufacturing Symposium,2001,75 - 78.
    [49]J. J. Murray, C. J. Cox, G. G. Lendaris, and R. E. Saeks. Adaptive dynamic programming. IEEE Transactions on Systems, man, and Cybernetics,2002,32(5):140-153.
    [50]J. J. Murray, C. J. Cox, R.E.Saeks. The adaptive dynamic programming theorem In Stability and Control of Dynamical Systems with Application, Boston, MA:Birkhauser,2003.
    [51]D.Vrabie, M.A.Khalaf, F.L.Lewis, Y.Y. Wang. Continuous-Time ADP for Linear Systems with Partially Unknown Dynamics. Proceedings of IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning,2007,247-253
    [52]T. T. Shannon, R. A. Santiago, G. L. George. Accelerating critic learning in approximate dynamic programming via value templates and perceptual learning. Proceedings of International Joint Conference on Neural Networks,2003,4:2922-2927.
    [53]J. Xu, F. M. Liang, W. S. Yu. Learning with Eligibility Traces in Adaptive Critic Designs. Proceedings of IEEE International Conference on Vehicular Electronics and Safety,2006, 309-313.
    [54]G. K. Venayagamoorthy, R. G. Harley and D. G. Wunsch. Comparison of heuristic dynamic programming and dual heuristic programming adaptive critics for neurocontrol orturbogenerator. IEEE Transactions on Neural Networks,2002,13(5):764-773.
    [55]S. Mohagheghi, G. K. Venayagamoorthy, R. G. Harley. Adaptive Critic Design Based Neuro-Fuzzy Controller for a Static Compensator in a Multimachine Power System. IEEE Transactions on Power Systems,21(4):1744-1754.
    [56]S. N. Balakrishnan, V. Biega. Adaptive-critic-based neural networks for aircraft optimal control. Journal of Guidance. Control, Dynamics,1996,19(4):893-898.
    [57]J. Dalton and S. N. Balakrishnan. A neighboring optimal adaptive critic for missile guidance. Mathematical and Computer Modeling,1996,23(1):175-188.
    [58]R. Enns, S. Jennie. Helicopter trimming and tracking control using direct neural dynamic programming. IEEE Transactions on Neural Networks.2003,14(4):929-939.
    [59]Y. Zhang and D. Liu. Call admission control for cdma cellular networks using adaptive critic designs. Proceedings of the 18th IEEE International Symposium on Intelligent Control,2003, 9:511-516.
    [60]A. Pietrabissa, D. Anticoli. An approximate dynamic programming approach to admission control in WCDMA networks. Computer Aided Control System Design, Proceedings of the IEEE International Symposium on Intelligent Control,2006,2087-2092.
    [61]Z. Sun, X. Chen, Z. H. He. Adaptive Critic Design for Energy Minimization of Portable Video Communication Devices. Proceedings of the 17th International Conference on Computer Communications and Networks,2008,1-5.
    [62]H. Javaherian, D. Liu, Y. Zhang, and O. Kovalenko. Adaptive critic learning techniques for automotive engine control. Proceeding of American Control Conference,2004,6:4066-4071.
    [63]X. F. Lin, L. Yu, S. J. Song. The Application of Action-Dependent Adaptive Critic Designs in Boiler Combustion Control. Proceeding of International Conference on Intelligent Computation Technology and Automation,2008,2:39-43.
    [64]J. J. Govindhasamy, S. F. McLoone, G.W.Irwin. Sequential Learning for Adaptive Critic Design:An Industrial Control Application. Proceeding of IEEE Workshop on Machine Learning for Signal Processing,2005,265-270.
    [65]R. Zaman, D. Prokhorov, D.C. Wunsch. Adaptive critic design in learning to play game of Go. Proceedings of International Conference on Neural Networks,1997,1:1-4.
    [66]R. L. Welch, G. K. Venayagamoorthy. Comparison of Two Optimal Control Strategies for a Grid Independent Photovoltaic System. Proceedings of IEEE International Conference on Industry Applications Conference,2006,3:1120-1127.
    [67]R. L. Welch, G. K. Venayagamoorthy. HDP based optimal control of a grid independent PV system. Proceedings of IEEE Power Engineering Society General Meeting,2006,6-9.
    [68]W. M. Li, G. Q. Xu, Z. C. Wang, Yangsheng Xu. Dynamic energy management for hybrid electric vehicle based on approximate dynamic programming. Proceedings of the 7th World Congress on Intelligent Control and Automation,2008,7864-7869.
    [69]G. G. Lendaris, T. T. Shannon, L. J. Schultz, S.Hutsell, A.Rogers. Dual heuristic programming for fuzzy control. Proceedings of IFSA World Congress and 20th NAFIPS International Conference,2001,1:551-556.
    [70]W. S. Lin, P. C. Yang. DHP Adaptive Critic Motion Control of Autonomous Wheeled Mobile Robot. Proceedings of IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning,2007,311-317.
    [71]G. G. Lendaris, L. Schultz, T. Shannon. Adaptive critic design for intelligent steering and speed control of a 2-axle vehicle. Proceedings of IEEE-INNS-ENNS International Joint Conference on Neural Networks,2000,3:73-78.
    [72]D. R. Liu. An adaptive critic approach for self-learning stock trading. Proceedings of IEEE/IAFE 1999 Conference on Computational Intelligence for Financial Engineering,1999, 271-280.
    [73]F. Y. Wang, H. G. Zhang, D.R. Liu. Adaptive Dynamic Programming:An Introduction, IEEE computational intelligence magazine,2009,39-47.
    [74]程玉虎,王雪松,孙伟.一种自适应强化学习算法在状态空间构建中的应用,系统仿真学报,2006,18(1):188-191.
    [75]J. Xu, W. S. Yu, F. Y. Wang. Ramp metering based on adaptive critic designs. Proceedings of the IEEE Intelligent Transportation Systems Conference,2006,1531-1536.
    [76]许静.基于神经网络最优化的交通控制技术.博士学位论文,中国科学院自动化所研究院.2007.
    [77]X. R. Bai, D. B. Zhao, J. Q. Yi, J. Xu. Ramp metering based on on-line ADHDP (λ) controller. Proceedings of IEEE International Joint Conference on Neural Networks,2008,1847-1852.
    [78]T. Li, D. B Zhao, J. Q. Yi. Adaptive Dynamic Programming for Multi-intersections Traffic Signal Intelligent Control. Proceedings of the 11th International IEEE Conference on Intelligent Transportation Systems,2008,286-291.
    [79]T. Li, D. B. Zhao, J. Q. Yi. Adaptive dynamic neuro-fuzzy system for traffic signal control. Proceedings of IEEE International Joint Conference on Neural Networks,2008,1840 - 1846.
    [80]L. A. Zadeh, Fuzzy sets, Information and Control,1965,8 (3),338-353.
    [81]L. A. Zadeh, Outline of a new approach to the analysis of complex systems and decision processes, IEEE Transactions on Systems, Man and Cybernetics,1973,3(1),28-44.
    [82]E. Mamdani, and S. Assilian, An experiment in linguistic synthesis with a fuzzy logic controller, International Journal of Man-Machine Studies,1975,7(1),1-13.
    [83]E. Mamdani, Advances in the linguistic synthesis of fuzzy controllers, International Journal of Man-Machine Studies,1976,8 (6),669-678.
    [84]J. H. Holland, Adaptation in natural and artificial systems, The MIT Press,1992.
    [85]W. S. McCulloch, and W. H. Pitts, A logical calculus of the ideas immanent in nervous activity, Bulletin of Mathematical Biophysics,1943,5,115-133.
    [86]M. Minsky, and S. Papert, Perceptrons:an introduction to computational geometry, The MIT Press,1969.
    [87]J. J. Hopfield, Neural networks and physical systems with emergent collective computational abilities, Proceedings of the National Academy of Sciences of the USA,1982,79 (8), 2554-2558.
    [88]侯忠生.非线性系统参数辨识、自适应控制及无模型学习自适应控制,沈阳:东北大学博士论文,1994.
    [89]侯忠生.无模型学习自适应控制理论,哈尔滨:哈尔滨工业大学博士后科学研究报告,1997.
    [90]侯忠生.非参数模型及其自适应控制理论,北京:科学出版社,1999.
    [91]侯忠生.无模型自适应控制的现状和展望,控制理论与应用,2006,23(4):586-592.
    [92]J. X. Xu, and Y. Tan. Linear and nonlinear iterative learning control. Springer-Verlag Berlin Heidelberg,2003.
    [93]贺国光,李宇,马寿峰.基于数学模型的短时交通流预测方法探讨.系统工程理论与实践,2000(12):51-56.
    [94]B. L. Smith, M. J. Dermetsky. Traffic flow forecasting:comparison of modeling approaches, Journal of Transportation Engineering,1997,123(4):261-266.
    [95]M.V.D. Voort, M. Dougherty, S. Watson. Combining kohonen maps with ARIMA time series models to forecast traffic flow, Transportation Research C,1996,4(5):307-318.
    [96]I. Okutani, Y.J. Stephamedes. Dynamic prediction of traftic volume through kalman filtering theory. Transportation Research B,1984,18B(1):1-11.
    [97]杨兆升,朱中.卡尔曼滤波理论的交通流实时预测模型.中国公路学报,2003,12(3):63-67.
    [98]M. S. Doughertty, M. R. Cobbett. Short-term inter-urban traffic forecasting using neural networks, International Journal of forecasting,1997,13:21-31.
    [99]B. Park, C.J. Messer, T. Urbanik. Short term freeway traffic volumeforecasting using radial basis function neural networks, WashingtonD C:National Academy Press,1998,39-47.
    [100]J.W.C. Van, S. P. Hoogendoom. Freeway Travel Time Prediction with State space Neural Networks:Modeling State space Dynamics with Recurrent Neural Networks. Washington D C: National Academy Press,2002:30-39.
    [101]S. Ishak, C. Aleesandru. Optimizing Traffic Prediction Performance of Neural Networks under Various Topological, Input, and Traffic Condition Setting, Journal of Transportation Engineering,2004,130(7):452-465.
    [102]B. Monica. Learning without Local Minimain Radial Basis Function Networks, IEEE Trans on Neural Networks,1995.6(3):749-755.
    [103]R. B. Allsop. SIGSET: A computer program for calculating traffic capacity of signal-controlled road junctions. Traffic Engineering Control,1971,12:58-60.
    [104]R. B. Allsop. SIGCAP: A computer program for assessing the traffic capacity of signal-controlled road junctions. Traffic Engineering Control,1976,17:338-341.
    [105]F. V. Webster. Traffic signal settings. Road Research Laboratory, London, U.K., Road Research Technical Paper 39,1958.
    [106]C. P. Pappis, E. H. Mamdani. A fuzzy logic controller for a traffic junction. IEEE Transactions on Systems, Man, and Cybernetics,1977,7:707-717.
    [107]B. T. Mohamed, S. K. Mohamed, and A.Murali. A two-stage fuzzy logic controller for traffic signals. Transportation Research Part C,1999,7:353-367.
    [108]J. J. Henry, J. L. Farges, J. L. Galleg. Neuro-fuzzy techniques for traffic control. Control Engineering Practice,1998,6:755-761.
    [109]E. Bingham. Reinforement learning in neurofuzzy traffic signal control. European Journal of Operation Research,2001,131:232-241.
    [110]李秀平,刘智勇等.平面交叉路口的神经网络自学习控制方案.信息与控制,2001,30(1):76-79.
    [111]T. L. Thorpe. Vehicle traffic light control using SARSA. Master's Projcet Report, Computer Science Department, Colorado State University, Colo,1997.
    [112]B. Abdulhai, R. Pringle, G. J. Karakoulas. Reinforement learning for true adaptive traffic signal control. ASCE Journal of Transportation Engineering,2003,129(3):278-285.
    [113]A. Martin and W. Brauer. Fuzzy model-based reinforcement learning. Proceedings of the European Symposium on Intelligent Technique,2000,14-15.
    [114]J. D. C.Little. The synchronizationof traffic signals by mixed-integer-linear programming. Operation Research,1966,568-594.
    [115]J. D. C. Little, M. D.Kelson, N. H. Gartner. MAXBAND: A program for setting signals on arteries and triangular networks. Transportation Research Record,1981,795:40-46.
    [116]N. H. Gartner. Road traffic control: Progression methods. Concise Encyclopedia of traffic and Transportation Systems, Pergamon Press, Oxford, UK,1911,391-396.
    [117]N. H. Gartner, S. F. Assmann, F.Lasaga and D.L.Hom. A multiband approach to arterial traffic signal optimization. Transportation Research PartB,1991,25(1):55-74.
    [118]李灵犀,高海军,陈龙.两相邻路口信号的分层递阶模糊控制.中国公路学报,2002,15(4):66-68.
    [119]李灵犀,高海军.相邻路口信号的协调控制.自动化学报,2003,29(6):947-952.
    [120]T. H. Heung, T. K. Ho, and Y.F.Fung. Coordinated road-junction traffic control by dynamic programming. IEEE Transactions on Intelligent Transportation Systems,2005,6(3):341-350.
    [121]马寿峰,李英,刘豹.一种丛于agent协调的两路口交通控制方法.系统工程学报,2003,18(3):272-278.
    [122]D. I. Roberson. TRANSYT method for area trafficcontrol. Traffic Engineering and Control, 1969,10:276-281.
    [123]M. T. Li, A.C. Gan. Signal timing optimization for oversaturated networks using TRANSYT-7F. Transportation Research Record,1999,1638:118-126.
    [124]P. B. Hunt, D. L. Robertson, R. D. Bretherton. The SCOOT on-line traffic signal optimization technique. Traffic Engineering and Control,1982,23:190-192.
    [125]P. Lowrie. The Sydneycoordinated adaptive traffic system:Principles, Methodology, Alogrithms. Proceedings of International Conference on Road Traffic Siganlling,1982,67-70.
    [126]A. G. Sims and A. B. Finlay. SCATS, splitsand offsets simplified (S.O.S.). Proceedings of the ARRB,1984,12-14.
    [127]O. P. Dell and P. B. Mirchandani. REALBAND:An approach for real-time coordination of traffic flow on networks. Transportation Research Record,1995,1494:106-116.
    [128]K. L. Head. Event-based short-term traffic flowprediction model. Transportation Research Record,1995,1510:45-52.
    [129]P. B. Mirchandani, A. Knyazyan, K. L. Head, and W. J. Wu. An approach towards the intergration of bus priorityand traffic adaptivesignal control. Proceedings of International conference on computer-aided scheduling of public transport,2001,505:319-334.
    [130]P. B. Mirchandani and K. L. Head. A real-time traffic signal control system:Architecture Algorithm and Analysis. Transportation Research Part C,2001,9:415-432.
    [131]P. Varaiya. Reducing highway congestion:an empirical approach. Special Issue:Fundamental Issue in Control, European Journal of Control,2005,11(4-5):301-309.
    [132]M. Parageorgiou, J. M. Blosseville, and H. H. Salem. Modeling and real time control on traffic flow on the southern part of Boulevard Peripherique in Paris. Part I:Modeling; Part II: Coordinated on-ramp metering. Transportation Research Part A,1990,24:345-370.
    [133]G. Paesani, J. Kerr, P. Perovich et al. System wide adaptive ramp metering in southern California. ITS America 7th Annual Meeting,1997.
    [134]C. Chen, Z. F. Jia, P. Baraiya. Causes and Cures of Highway congestion. IEEE Control Systems Magazine,2001,21(6):26-32.
    [135]M. Papageorgiou. ALINEA:A local feedback control law for on-ramp metering. Transportation Research Record,1991,1320:58-64.
    [136]E. Smaragdis, M.Papageorgiou A series of new local ramp metering strategies. Transportation Research Record,2003,1856:74-86.
    [137]C. Taylor, D. Meldrum, L. Jacobson. Fuzzy ramp metering-design overview and simulation results. Transportation Research Record,1998,1634:10-18.
    [138]H. M. Zhang Freeway ramp metering using artificial neural network. Transportation Research Part C,1997,5(5):273-286.
    [139]Y. Stephanedes, K. Chang Optimal control of freeway corridors. ASCE Journal of Transportation Engineering,1993,119(4):504-114.
    [140]A. Kotsialos, M. Papageorgiou, M. Mangeas, et al. Coordinated and integrated control of motorway networks via non-linear optimal control. Transportation Research Part C,2002, 10(1):65-84.
    [141]O. Chen, A. Hotz, M. B. Akiva Development and evaluation of a dynamic ramp metering control model. Proceedings of the 8th IFAC Symposium on Transportation Systems,1997, 1162-1167.
    [142]J. C. Liu, J. Kim, Y. Chen, et al. An advanced real time metering system:The system concept. Texas:Department of Transportation,1993.
    [143]M. Papageorgiou A hierarchical control system for freeway traffic. Transportation Research PartB,1983,17(3):251-261.
    [144]G. L.Chang, J. Wu, S. Coben. Integrated real-time ramp metering model for nonrecurrent congestion: Framework and preliminary results. Transportation Research Record,1994,1446: 56-65.
    [145]L. Lipp, L. Corcoran, G. Hickman Benefits of central computer control for the Denver ramp metering system. Transportation Research Board Record,1991,1320:3-6.
    [146]J. Banks. Effect of response limitations on traffic-responsive ramp metering. Transportation Research Board Record,1993,1394:17-25.
    [147]L. Jacobsen, K. Henry, O.Mahyar. Real-time metering algorithm for centralized control. Transportation Research Record,1989,1232:17-26.
    [148]R. L. Welch, G. K. Venayagamoorthy. Comparison of Two Optimal Control Strategies for a Grid Independent Photovoltaic System. Proceedings of IEEE International Conference on Industry Applications,2006,3:1120-1127.
    [149]K. W.Tang, G.Srikant. Reinforcement control via action dependent heuristic dynamic programming. Proceedings of IEEE International Conference on Neural Networks,1997,3 (3):1766-1770.
    [150]T.Hanselmann, A. Zaknich, L. Noakes, A. Savkin. A hybrid dynamical system with robust switching control by action dependent heuristic dynamic programming. Proceedings of IEEE International Joint Conference on Neural Networks,2004,3(3):1799-1804.
    [151]M. T. Hagan, H.B.Demut, M.Beale. Neural network design.北京:机械工业出版社(英文版), 2002.
    [152]Z. S. Hou, J. X. Xu, J.W.Yan. An Iterative Learning Approach for Density Control of Freeway Traffic Flow via Ramp Metering. Transportation Research Part C,2008,16:71-97
    [153]S. Kumar. Neural Networks. 清华大学出版社,2006
    [154]刘乐,王洪国等.一种有效脱离平坦区的改进BP网络,计算机科学,2008,35(7):95-99.
    [155]R. A. Jacobs Increased rates of convergence through learning rate adaptation. Neural Networks, 1988,1(4):295-308.
    [156]D. E. Rumelhart, G. E. Hinton, R. J. Williams. Learning representation by back-propagation errors. Nature,1986,323:533-536.
    [157]X. H. Yu, G. A. Chen, S. X. Cheng. Dynamic Learning Rate Optimization of the Backpropagation Algorithm. IEEE Transactions on Neural Networks,1995,6(3):669-677.
    [158]X. H. Yu, G. A. Chen. Efficient Backpropagation Learning Using Optimal Learning Rate and Momentum. Neural Networks,1997,10(3):517-527.
    [159]P. Chandra, Y. Singh. An activation function adapting training algorithm for sigmoidal feedforward networks. Neurocomputing,2006,61:429-437.
    [160]X. G. Wang, Z.Tang, H.Tamur, et al. An improved backpropagation algorithm to avoid the local minima problem. Neurocomputing,2004,56:455-460.
    [161]邓建中Steffensen迭代加速法的改进.西安交通大学学报,1993,27(3):99-103.
    [162]关治,陈景良.数值计算方法.北京:清华大学出版社,1990.
    [163]M. Parageorgiou, A. Kotsiaios. Freeway ramp metering:an overview. IEEE Transactions on Intelligent Transportation Systems,2002,3(4):271-281.
    [164]M. Cremer, M Papageorgiou. Parameter identification for a traffic flow model.Automatica, 1981,17(6):837-843.
    [165]M. S. Grewail, H. J. Payne. Identification of parameters in a freeway traffic model.IEEE Transactions on Systems,Man,and Cybernetics,1976,6(3):176-286.
    [166]M. J. Lighthill, G. B. Whitham. On the kinematic waves II:a theory of traffic flow on long crowded roads. Proceedings of the Royal Society A:Mathematical and Physicai Sciences,London, UK,1955:317-345.
    [167]侯忠生,金尚泰,赵明.宏观交通流模型参数的迭代学习辨识方法[J].自动化学报,2008,34(1):65-70.
    [168]王新洲.非线性模型参数估计理论与应用[M].武汉,武汉大学出版社.2004:52-73.
    [169]么健石,侯样林,徐心和.丛于动态设计变最优化算法的非线性模型参数估计[J].东北大学学报(自然科学版),2004,25(2):106-109.
    [170]X. Liu, S. N. Balakrishnan. Convergence analysis of adaptive critic based optimal control. Proceedings of American control conference,2000:1929-1933.
    [171]V. A. Bart, R. K. Howard. Recent Advances and Applications in the Field of Short-term Traffic Forecasting. International Journal of Forecasting,1997,13(1):1-12.
    [172]刘静,关伟.交通流预测方法综述.公路交通科技,2004,21(3):82-85.
    [173]史其信,郑为中.道路网短期交通流预测方法比较.交通运输工程学报.2004,4(4):68-71.
    [174]李星毅,基于相似性的交通流分析方法,北京:北京交通大学,博士论文,2010.
    [175]王正武,黄中祥,短时交通流预测模型的分析与评价,系统工程,2003,21(6):97-100.
    [176]F. V. Webster. Traffic signal settings. Roads Research Laboratory, London, U.K., Road Research Tec,1958, Paper No.39.
    [177]A. D. May. Traffic flow theory-the traffic engineer's challenge. Proceedings of the Institute of Transportation Engineers,1965,290-303.
    [178]R. E. Allsop. Delay at a fixed time traffic signal I:theoretical analysis. Transportation Science, 1972,6(3):260-285.
    [179]W. B. Cronje. Optimization model for isolated signalized traffic intersections. Transportation Research record,1983,905:80-83.
    [180]S. M. Elahi, A. E. Radwan, and K.M.Goul.Knowledge-based system for adaptivetraffic signal control. Transportation Research record,1991,1324:115-122.
    [181]P. G. Michalopoulos and G. Stephanopoulos. Optimum control of oversaturated intersections: Theoretical and practical consideration. Traffic Engineering and Control,1978,19(5): 216-221.
    [182]T. H. Chang and J. Y. Lin. Optimal signal timing for an oversaturated intersection. Transportation Research Part B,2000,34:471-491.
    [183]赵欢.单信号交叉口绿灯时间无模型自适应控制.硕士学位论文,北京交通大学,2009.
    [184]贾琰.基于近似动态规划的交通控制算法的研究.硕士学位论文,北京交通大学,2008.
    [185]蒲琪,谭永朝,杨超.交义口信号配时优化模型研究.上海铁道大学学报,1999,20(4):4-6.
    [186]H. M. Zhang and S. G. Ritchie. Freeway ramp metering using artificial neural network. Transportation Research Part C,1997,5(5):273-286.
    [187]H. M. Zhang, S. G. Ritchie, and R.Jayakrishnan. Coordinated traffic responsive ramp control via nonlinear state feedback. Transportation Research Part C,2001,9:337-352.
    [188]Z. S. Hou, J. X. Xu. Freeway traffic density control using iterative learning control approach. Proceedings of the IEEE 6th International Conference On Intelligent Transportation Systems, 2003,1081-1086.
    [189]M. J. Lighthill, G. B. Whitham. On the kinematic waves II:A theory of traffic flow on long crowed roads. Proceedings Of Royal Society of London, Series,A,1955,229(1178):317-345.
    [190]王春蔚,基于无模型自适应控制理论的局部入口匝道控制方法研究,硕士学位论文,北京交通大学,2005.
    [191]金尚泰.无模型学习自适应控制的若干问题研究及应用.博士学位论文,北京交通大学,2008.

© 2004-2018 中国地质图书馆版权所有 京ICP备05064691号 京公网安备11010802017129号

地址:北京市海淀区学院路29号 邮编:100083

电话:办公室:(+86 10)66554848;文献借阅、咨询服务、科技查新:66554700