MIT开放课程Dynamic Programming Lecture (7)
更新时间:2023-04-21 17:26:01 阅读量: 实用文档 文档下载
- mit开放课程网站推荐度:
- 相关推荐
6.231DYNAMICPROGRAMMING
LECTURE7
LECTUREOUTLINE
Deterministiccontinuous-timeoptimalcontrol Examples
Connectionwiththecalculusofvariations
TheHamilton-Jacobi-Bellmanequationasacontinuous-timelimitoftheDPalgorithm
TheHamilton-Jacobi-Bellmanequationasasuf- cientcondition
Examples
PROBLEMFORMULATION
Wehaveacontinuous-timedynamicsystem x˙(t)=fx(t),u(t),0≤t≤T,x(0):given,where
x(t)∈ nisthestatevectorattimet
u(t)∈U misthecontrolvectorattimet,Uisthecontrolconstraintset
Tistheterminaltime. Anyadmissiblecontroltrajectoryu(t)|t∈[0,T] (piecewisecontinuousfunctionu(t)|t∈[0,T]withu( t)∈Uforallt ∈[0,T]),uniquelydeter-minesx(t)|t∈[0,T]. Findanadmissiblecontroltrajectoryu(t)|t∈ [0,T] andcorrespondingstatetrajectoryx(t)|t∈
[0,T],thatminimizesacostfunctionoftheform
hx(T)+
0T gx(t),u(t)dt
f,h,gareassumedcontinuouslydifferentiable.
Motioncontrol:Aunitmassmovesonalineunderthein uenceofaforceu. x(t)=x1(t),x2(t):positionandvelocityofthemassattimet Problem:Fromagivenx1(0),x2(0),bringthemass“near”agiven nalposition-velocitypair(1,2)attimeTinthesense:
2 2 minimize x1(T) 1 + x2(T) 2
subjecttothecontrolconstraint
|u(t)|≤1,forallt∈[0,T].
Theproblem tstheframeworkwith
x˙1(t)=x2(t),
x˙2(t)=u(t), 2 2 hx(T)= x1(T) 1 + x2(T) 2 ,
gx(t),u(t)=0,forallt∈[0,T].
Aproducerwithproductionratex(t)attimetmayallocateaportionu(t)ofhis/herproductionratetoreinvestmentand1 u(t)toproductionofastorablegood.Thusx(t)evolvesaccordingto
x˙(t)=γu(t)x(t),
whereγ>0isagivenconstant.
Theproducerwantstomaximizethetotalamountofproductstored
0T 1 u(t)x(t)dt
subjectto
0≤u(t)≤1,forallt∈[0,T].
Theinitialproductionratex(0)isagivenpositivenumber.
EXAMPLEIII(CALCULUSOFVARIATIONS)
Point Findacurvefromagivenpointtoagivenlinethathasminimumlength.
Theproblemis
minimize
0T 21+x˙(t)dt
subjecttox(0)=α.
Reformulationasanoptimalcontrolproblem:
minimize
0T 21+u(t)dt
subjecttox˙(t)=u(t),x(0)=α.
Wediscretize[0,T]attimes0,δ,2δ,...,Nδ,whereδ=T/N,andwelet
xk=x(kδ),uk=u(kδ),k=0,1,...,N. Wealsodiscretizethesystemandcost:xk+1=xk+f(xk,uk)·δ,h(xN)+N 1
k=0g(xk,uk)·δ.
WewritetheDPalgorithmforthediscretizedproblem (Nδ,x)=h(x),J (k+1)·δ,x+f(x,u)·δ. (kδ,x)=ming(x,u)·δ+JJu∈U
isdifferentiableandTaylor-expand: AssumeJ
J(kδ,x)=ming(x,u)·δ+J(kδ,x)+ tJ(kδ,x)·δ
u∈U
+ xJ(kδ,x)f(x,u)·δ+o(δ).
LetJ (t,x)betheoptimalcost-to-goofthecon-tinuousproblem.Assumingthelimitisvalidk→∞,δ→0,kδ=tlim (kδ,x)=J (t,x),Jforallt,x,
weobtainforallt,x, 0=ming(x,u)+ tJ (t,x)+ xJ (t,x) f(x,u)u∈U
withtheboundaryconditionJ (T,x)=h(x).
ThisistheHamilton-Jacobi-Bellman(HJB)equa-tion–apartialdifferentialequation,whichissat-is edforalltime-statepairs(t,x)bythecost-to-gofunctionJ (t,x)(assumingJ isdifferentiableandtheprecedinginformallimitingprocedureisvalid). ItishardtotellaprioriifJ (t,x)isdifferentiable. SoweusetheHJBEq.asaveri cationtool;ifwecansolveitforadifferentiableJ (t,x),then: J istheoptimal-cost-to-gofunction
Thecontrolµ (t,x)thatminimizesintheRHSforeach(t,x)de nesanoptimalcontrol
VERIFICATION/SUFFICIENCYTHEOREM SupposeV(t,x)isasolutiontotheHJBequa-tion;thatis,Viscontinuouslydifferentiableintandx,andissuchthatforallt,x,
0=ming(x,u)+ tV(t,x)+ xV(t,x) f(x,u),u∈U
V(T,x)=h(x),forallx.
Supposealsothatµ (t,x)attainstheminimumaboveforalltandx. Letx (t)|t∈[0,T]andu (t)=µ t,x (t),t∈[0,T],bethecorrespondingstateandcontroltrajectories.
Then
V(t,x)=J (t,x),
forallt,x, andu (t)|t∈[0,T]isoptimal.
PROOF
Let{( u(t),x (t))|t∈[0,T]}beanyadmissiblecontrol-statetrajectory.Wehaveforallt∈[0,T]
˙(t)=fxUsingthesystemequationx (t),u (t),
theRHSoftheaboveisequalto
d V(t,x (t))gx (t),u (t)+dt
Integratingthisexpressionovert∈[0,T],
0≤
0T0≤gx (t),u (t)+ tVt,x (t)+ xVt,x (t)fx (t),u (t). gx (t),u (t)dt+VT,x (T) V0,x (0).
UsingV(T,x)=h(x)andx (0)=x(0),wehave T V0,x(0)≤hx (T)+gx (t),u (t)dt.
Ifweuseu (t)andx (t)inplaceofu (t)andx (t),theinequalitiesbecomesequalities,and T gx (t),u (t)dt.V0,x(0)=hx (T)+
EXAMPLEOFTHEHJBEQUATION
Considerthescalarsystemx˙(t)=u(t),with|u(t)|≤ 21andcost(1/2)x(T).TheHJBequationis
0=min tV(t,x)+ xV(t,x)u,|u|≤1forallt,x,
withtheterminalconditionV(T,x)=(1/2)x2. Evidentcandidateforoptimality:µ (t,x)= sgn(x).Correspondingcost-to-go 21J (t,x)=max0,|x| (T t). WeverifythatJ solvestheHJBEq.,andthatu= sgn(x)attainstheminintheRHS.Indeed, tJ (t,x)=max0,|x| (T t),
xJ (t,x)=sgn(x)·max0,|x| (T t).
Substituting,theHJBEq.becomes
0=min1+sgn(x)·umax0,|x| (T t)|u|≤1
LINEARQUADRATICPROBLEM
Considerthen-dimensionallinearsystem
x˙(t)=Ax(t)+Bu(t),
andthequadraticcost
x(T) QTx(T)+
0T x(t) Qx(t)+u(t) Ru(t)dt
TheHJBequationis
0=minxQx+uRu+ tV(t,x)+ xV(t,x)(Ax+Bu),mu∈
withtheterminalconditionV(T,x)=x QTx.Wetryasolutionoftheform
V(t,x)=x K(t)x,K(t):n×nsymmetric,andshowthatV(t,x)solvestheHJBequationif˙(t)= K(t)A A K(t)+K(t)BR 1B K(t) QK
withtheterminalconditionK(T)=QT.
正在阅读:
MIT开放课程Dynamic Programming Lecture (7)04-21
七年级语文下册第11课人民英雄永垂不朽练习(含答案)2苏教版word04-18
锅炉给水泵试运转方案07-21
物象的多视觉表达(教案设计)09-03
天津市气象条例08-14
关于抵制和防范境内外敌对势力利用宗教向校园渗透管理办法06-27
高速公路桥梁加固处治探讨07-28
临床三基试卷1104-14
云南省城市总体规划实施评估和监督检查工作办法(公示稿)04-10
20XX年3月消防安全工作计划12-11
- 1MIT考试题
- 2Lecture 13 Religion
- 3Evaluating a new programming language
- 4DYNAMIC PATH-PLANNING FOR SEARCH AND DESTROY
- 5Lecture 2
- 6Slides_Lecture1
- 7lecture 1-3
- 8Programming Actionscript 3.0 中文版
- 9UNIVERSITY OF SOUTHAMPTON Dynamic Discovery, Creation and In
- 10Integrating Dynamic Deformations into Interactive Volume
- 教学能力大赛决赛获奖-教学实施报告-(完整图文版)
- 互联网+数据中心行业分析报告
- 2017上海杨浦区高三一模数学试题及答案
- 招商部差旅接待管理制度(4-25)
- 学生游玩安全注意事项
- 学生信息管理系统(文档模板供参考)
- 叉车门架有限元分析及系统设计
- 2014帮助残疾人志愿者服务情况记录
- 叶绿体中色素的提取和分离实验
- 中国食物成分表2020年最新权威完整改进版
- 推动国土资源领域生态文明建设
- 给水管道冲洗和消毒记录
- 计算机软件专业自我评价
- 高中数学必修1-5知识点归纳
- 2018-2022年中国第五代移动通信技术(5G)产业深度分析及发展前景研究报告发展趋势(目录)
- 生产车间巡查制度
- 2018版中国光热发电行业深度研究报告目录
- (通用)2019年中考数学总复习 第一章 第四节 数的开方与二次根式课件
- 2017_2018学年高中语文第二单元第4课说数课件粤教版
- 上市新药Lumateperone(卢美哌隆)合成检索总结报告
- Programming
- Dynamic
- Lecture
- 课程
- 开放
- MIT
- 2011年2月23日北京国税和湖北国税面试真题
- 道路工程设计说明02
- 2015年广东事业单位面试备考:如何三天让面试无忧?
- 精炼棕仁油项目可行性研究报告备案用(专家版)
- 考研生理学课件---能代(吴)
- paradoxes with positive and mixed consequences
- 2012年重庆市中考语文古诗文复习篇目
- 复变函数与积分变换第十讲
- 3d开奖结果自动更新
- 一命二运三风水四积阴德五读书
- 2012江苏省一级助理电子商务师考试题库
- 第六章 知识的学习
- 梦游天姥吟留别导学案
- 凤仙花的发芽与生长的观察记录表
- 可编程脉冲信号发生器的设计
- 2013-2014年中国科学引文数据库CSCD核心库(C库)
- 区域性防突治理技术方案
- 挖掘机主减速器箱体加工工艺及其工装设计说明书
- 电机控制技术-绪论-第一章之1.3
- 【七年级上册数学精品优质课件】第一章有理数总复习