期刊网_中国期刊网

Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning

简介：Aprimarychallengeofagent-basedpolicylearningincomplexanduncertainenvironmentsisescalatingcomputationalcomplexitywiththesizeofthetaskspace(actionchoicesandworldstates)andthenumberofagents.Nonetheless,thereisampleevidenceinthenaturalworldthathigh-functioningsocialmammalslearntosolvecomplexproblemswithease,bothindividuallyandcooperatively.Thisabilitytosolvecomputationallyintractableproblemsstemsfrombothbraincircuitsforhierarchicalrepresentation...
标签：分散的 Markov 决定过程加强学习 Multiagent 系统

全文阅读

Stable reinforcement learning with recurrent neural networks

简介：Inthispaper,wepresentatechniqueforensuringthestabilityofalargeclassofadaptivelycontrolledsystems.WecombineIQCmodelsofboththecontrolledsystemandthecontrollerwithamethodoffilteringcontrolparameterupdatestoensurestablebehaviorofthecontrolledsystemunderadaptationofthecontroller.Wepresentaspecificapplicationtoasystemthatusesrecurrentneuralnetworksadaptedviareinforcementlearningtechniques.Theworkpresentedextendsearlierworksonstablereinforce...
标签：稳定性分析不可分的二次的限制周期性的神经网络加强学习线性矩阵不平等

全文阅读

Ballistic learning control： formulation, analysis and convergence

简介：Inthispaper,weformulateandexplorethecharacteristicsofiterativelearninginballisticcontrolproblems.Theiterativelearningcontrol（ILC）theoryprovidesasuitableframeworkforderivationsandanalysisofballisticcontrolunderlearningprocess.Toovercometheobstaclescausedbyuncertaingradientandredundantcontrolinput,weincorporateextratrialsintoiterativelearning.Withthehelpoftrialresults,propercontrolandupdatingdirectioncanbedetermined.Then,iterativelearningcanbeappliedtoballisticcontrolproblem.Severalinitialstatelearningalgorithmsarestudiedforinitialspeedcontrol,forcecontrol,aswellascombinedspeedandanglecontrol.Intheend,shootinganglelearninginthebasketballshotprocessissimulatedtoverifytheeffectivenessofiterativelearningmethodsinballisticcontrolproblems.
标签： BALLISTIC CONTROL Iterative LEARNING CONTROL Initial

全文阅读

Electrocardiogram （ECG） pattern modeling and recognition via deterministic learning

作者： Xunde DONG;Cong WANG;Junmin HU;Shanxing OU
学科：自动化与计算机技术 > 控制理论与控制工程
创建时间：2014-04-14
出处：《控制理论与技术：英文版》 2014年第4期

简介：模式经由确定的学习理论当模特儿和识别在这篇论文被介绍的为心电图(ECG)的一个方法。而不是认识到ECG表明beat-to-beat，包含很多心跳的每个ECG信号被认出。方法完全基于时间的特征(即，动力学)ECG模式，它包含ECG模式的完全的信息。一个动态模特儿被雇用表明方法，它能够产生合成ECG信号。基于动态模型，方法在下列二个阶段被显示出：鉴定(训练)阶段和识别(测试)分阶段执行。在鉴定阶段，ECG模式的动力学精确地被建模并且通过确定的学习表示了为经常的RBF神经重量。在识别阶段，当模特儿的结果被用于ECG模式识别。建议方法的主要特征是ECG模式的动力学精确地被当模特儿并且被用于ECG模式识别。用Physikalisch-TechnischeBundesanstalt(PTB)数据库的试验性的研究被包括表明途径的有效性。
标签： ECG信号识别方法精确建模心电图学习模式识别

全文阅读

Asymptotic tracking by a reinforcement learning-based adaptive critic controller

作者： Shubhendu BHASIN;Nitin SHARMA;Parag PATRE;Warren DIXON
学科：自动化与计算机技术 > 控制理论与控制工程
创建时间：2011-03-13
出处：《控制理论与技术：英文版》 2011年第3期

简介：Adaptivecritic(AC)basedcontrollersaretypicallydiscreteand/oryieldauniformlyultimatelyboundedstabilityresultbecauseofthepresenceofdisturbancesandunknownapproximationerrors.Acontinuous-timeACcontrollerisdevelopedthatyieldsasymptotictrackingofaclassofuncertainnonlinearsystemswithboundeddisturbances.TheproposedAC-basedcontrollerconsistsoftwoneuralnetworks(NNs)-anactionNN,alsocalledtheactor,whichapproximatestheplantdynamicsandgeneratesappropriatecon...
标签：适应批评家加强学习神经基于网络的控制

全文阅读

Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization

作者： Amanda Lampton;John Valasek;Mrinal Kumar
学科：自动化与计算机技术 > 控制理论与控制工程
创建时间：2011-03-13
出处：《控制理论与技术：英文版》 2011年第3期

简介：Amultiresolutionstate-spacediscretizationmethodwithpseudorandomgriddingisdevelopedfortheepisodicunsupervisedlearningmethodofQ-learning.Itisusedasthelearningagentforclosed-loopcontrolofmorphingorhighlyreconfigurablesystems.Thispaperdevelopsamethodwherebyastate-spaceisadaptivelydiscretizedbyprogressivelyfinerpseudorandomgridsaroundtheregionsofinterestwithinthestateorlearningspaceinanefforttobreaktheCurseofDimensionality.Utilityofthemethodi...
标签：加强学习 MORPHING 随机的格子

全文阅读

Quantum learning control using differential evolution with equally-mixed strategies

作者： Hailan MA;Daoyi DONG;Chuan-Cun SHU;Zhangqing ZHU;Chunlin CHEN
学科：自动化与计算机技术 > 控制理论与控制工程
创建时间：2017-03-13
出处：《控制理论与技术：英文版》 2017年第3期

简介：学习控制在量信息技术作为一条强大的途径被认出了。在这份报纸，我们扩大微分进化(DE)的应用程序为各种各样的量系统设计最佳的控制。各种各样的DE方法被介绍并且分析，并且在同等地混合的策略展示的EMSDE为量控制被采用。量的二个类控制问题，包括四水平的开的量整体和量的控制超导的系统，被调查为听说控制量系统表明EMSDE的性能。数字结果为各种各样的量系统验证EMSDE方法的有效性并且为复杂的量控制问题显示出潜力。
标签：量子信息技术学习控制混合策略差分进化开放量子系统控制问题

全文阅读

Fault detection for nonlinear discrete-time systems via deterministic learning

简介：最近，为小摆动差错的快速的察觉的一条途径为连续时间的系统基于确定的学习理论被建议。在这份报纸，一个差错察觉计划经由确定的学习为非线性的分离时间的系统的一个班被建议。由使用确定的学习算法的分离时间的扩展，一般差错工作(即，内部动力学)内在的正常和非线性的分离时间的系统的差错模式被分离时间的动态光线的基础功能(RBF)局部地精确地接近网络。然后，有嵌入的系统动力学的获得的知识的评估者的一个银行被构造，并且一套剩余被获得并且过去常测量监视系统的动力学和训练系统的动力学之间的差别。一个差错察觉决定计划根据最小的剩余原则被介绍，即，一个差错的出现能被比较剩余的大小在一个分离时间的背景检测。差错detectability分析被执行，察觉时间的上面的界限被导出。一个模拟例子被给说明建议计划的有效性。
标签：非线性离散时间系统故障检测学习算法连续时间系统快速检测方法径向基函数

全文阅读

Optimal higher order learning adaptive control approach for a class of SISO nonlinear systems

简介：Inthispaper,anoptimalhigherorderlearningadaptivecontrolapproachisdevelopedforaclassofSISOnonlinearsystems.Thisdesignismodel-freeanddependsdirectlyonpseudo-partial-derivativesderivedon-linefromtheinputandoutputinformationofthesystem.Anovelweightedone-step-aheadcontrolcriterionfunctionisproposedforthecontrollaw.Theconvergenceanalysisshowsthattheproposedcontrollawcanguaranteetheconvergenceundertheassumptionthatthedesiredoutputisasetpoint.Simulationexamplesareprovidedfornonlinearsystemstoillustratethebetterperformanceofthehigherorderlearningadaptivecontrol.
标签：知识系统自适应控制非线性系统优化设计

全文阅读

LMI-based robust iterative learning controller design for discrete linear uncertain systems

简介：Thispaperaddressesthedesignproblemofrobustiterativelearningcontrollersforaclassoflineardiscrete-timesystemswithnorm-boundedparameteruncertainties.Aniterativelearningalgorithmwithcurrentcyclefeedbackisproposedtoachievebothrobustconvergenceandrobuststability.Thesynthesisproblemoftheproposediterativelearningcontrol(ILC)systemisreformulatedasaγ-suboptimalH-infinitycontrolproblemviathelinearfractionaltransformation(LFT).AsufficientconditionfortheconvergenceoftheILCalgorithmispresentedintermsoflinearmatrixinequalities(LMIs).Furthermore,thelineartransferoperatorsoftheILCalgorithmwithhighconvergencespeedareobtainedbyusingexistingconvexoptimizationtechniques.Thesimulationresultsdemonstratetheeffectivenessoftheproposedmethod.
标签：知识系统迭代控制离散系统线性控制线性矩阵不等式

全文阅读

Bionic autonomous learning control of a two-wheeled self-balancing flexible robot

简介：这份报纸论述OCPA(发生作用之人或物调节概率的自动机)仿生的自治学习系统基于为解决一个二车轮的灵活机器人的平衡控制问题的Skinners发生作用之人或物调节理论。学习系统的OCPA由二个阶段组成：在第一个阶段，一个发生作用之人或物行动从一套发生作用之人或物行动随机地被选择然后用作控制系统的输入；在第二个阶段，学习系统收集系统的取向信息并且为优化使用它直到完成控制目标。同时，发生作用之人或物行动集合的尺寸能自动地为避免小概率事件在学习过程期间被减少。理论分析为在纸学习系统的设计OCPA被做，它理论上证明在学习系统的OCPA学习机制的发生作用之人或物调节的集中，也就是，发生作用之人或物行动熵将与学习进程收敛到最小。然后OCPA学习系统被用于姿势二车轮的灵活自我平衡的机器人的平衡控制。机器人不把posutre平衡处于起始的状态和在发生作用之人或物集合的每发生作用之人或物的选择概率的技巧是相等的。与学习继续，最佳的发生作用之人或物的选择可能性逐渐地逐渐地趋于到一个和发生作用之人或物行动熵趋于到最小，并且机器人逐渐地因此学习了姿势平衡技巧。
标签：柔性机器人平衡控制学习控制仿生概率自动机可操作性

全文阅读

Filter-based　iterative　learning　control　for　linear　large-scale　industrial　processes

简介：Intheprocedureofthesteady-statehierarchicaloptimizationwithfeedbackforlarge-scaleindustrialprocesses,asequenceofset-pointchangeswithdifferentmagnitudesiscarriedoutontheoptimizationlayer.Toimprovethedynamicperformanceoftransientresponsedrivenbytheset-pointchanges,afilter-basediterativelearningcontrolstrategyisproposed.Intheproposedupdatinglaw,alocal-symmetric-integraloperatorisadoptedforeliminatingthemeasurementnoiseofoutputinformation,asetofdesiredtrajectoriesarespecifiedaccordingtotheset-pointchangessequence,thecurrentcontrolinputisiterativelyachievedbyutilizingsmoothedoutputerrortomodifyitscontrolinputatpreviousiteration,towhichtheamplifiedcoefficientsrelatedtothedifferentmagnitudesofset-pointchangesareintroduced.Theconvergenceofthealgorithmisconductedbyincorporatingfrequency-domaintechniqueintotime-domainanalysis.Numnericalsimulationdemonstratestheeffectivenessoftheproposedstrategy.
标签：迭代学习控制大工业加工线性系统动态特性

全文阅读

Performance index based learning controls for the partial non-regular systems using lifting technique

作者： Shengyue YANG;Xiaoping FAN;Zhihua QU
学科：自动化与计算机技术 > 控制理论与控制工程
创建时间：2009-02-12
出处：《控制理论与技术：英文版》 2009年第2期

简介：为非常规的系统的基于表演的反复的学习控制(ILC)的缺乏详细被调查，然后，更新的更快的控制输入和上升技术在性能的设计被介绍为部分非常规的系统索引基于的ILC。基于不同性能，索引被考虑的二种最佳的ILC。最后，模拟例子被给说明建议学习控制的可行性。
标签：迭代学习控制正规系统性能指标提升技术基础国际法

全文阅读

Quasi-Newton-type optimized iterative learning control for discrete linear time invariant systems

简介：在这份报纸，quasi-Newton-type优化了控制(ILC)算法为分离线性时间不变的系统的一个类被调查的反复的学习。建议学习算法是由一个quasi-Newton-type矩阵更新学习获得矩阵而不是植物的倒置。借助于数学归纳方法，建议算法的单调集中被分析，它证明追踪的错误单调地在重复的一个有限数字以后收敛到零。与存在相比优化了ILC算法，由于伪的superlinear集中--牛顿方法，建议学习法律与更快的会聚的率操作并且对柔韧系统模型有病条件，并且因此拥有大量应用。数字模拟表明有效性和有效性。
标签：线性时不变系统迭代学习控制拟牛顿法优化离散学习算法

全文阅读

Feedback iterative learning control for time-delay systems based on 2D analysis approach

作者： Deyuan MENG;Yingmin JIA;Junping DU;Shiying YUAN
学科：自动化与计算机技术 > 控制理论与控制工程
创建时间：2010-04-14
出处：《控制理论与技术：英文版》 2010年第4期

简介：Thispaperdealswiththeiterativelearningcontrol(ILC)designformultiple-inputmultiple-output(MIMO),time-delaysystems(TDS).TwofeedbackILCschemesareconsideredusingtheso-calledtwo-dimensional(2D)analysisapproach.Itshowsthatcontinuous-discrete2DRoessersystemscanbedevelopedtodescribetheentirelearningdynamicsofbothILCschemes,basedonwhichnecessaryandsufficientconditionsfortheirstabilitycanbeprovided.Anumericalexampleisincludedtovalidatethetheoreticalanalysis.
标签：迭代学习控制二维分析时滞系统反馈多输入多输出时间延迟系统

全文阅读

On iterative learning control design for tracking iteration-varying trajectories with high-order internal model

作者： Chenkun YIN;Jianxin XU;Zhongsheng HOU
学科：自动化与计算机技术 > 控制理论与控制工程
创建时间：2010-03-13
出处：《控制理论与技术：英文版》 2010年第3期

简介：Inthispaper,iterativelearningcontrol(ILC)designisstudiedforaniteration-varyingtrackingprobleminwhichreferencetrajectoriesaregeneratedbyhigh-orderinternalmodels(HOIM).AnHOIMformulatedasapolynomialoperatorbetweenconsecutiveiterationsdescribesthechangesofdesiredtrajectoriesintheiterationdomainandmakestheiterativelearningproblembecomeiterationvarying.TheclassicalILCfortrackingiteration-invariantreferencetrajectories,ontheotherhand,isaspecialcaseofHOIMwherethepolynomialrenderstoaunitycoefficientoraspecialfirst-orderinternalmodel.ByinsertingtheHOIMintoP-typeILC,thetrackingperformancealongtheiterationaxisisinvestigatedforaclassofcontinuous-timenonlinearsystems.Time-weightednormmethodisutilizedtoguaranteevalidityofproposedalgorithminasenseofdata-drivencontrol.
标签：迭代学习控制控制设计跟踪问题内模高阶内部模型

全文阅读

Robust adaptive repetitive learning control for a class of time-varying nonlinear systems with unknown control direction

简介：Arobustadaptiverepetitivelearningcontrolmethodisproposedforaclassoftime-varyingnonlinearsystems.Nussbaum-gainmethodisincorporatedintothecontroldesigntocounteractthelackofaprioriknowledgeofthecontroldirectionwhichdeterminesthemotiondirectionofthesystemunderanyinput.Itisshownthatthesystemstatecouldconvergetothedesiredtrajectoryasymptoticallyalongtheiterationaxisthroughrepetitivelearning.Simulationiscarriedouttoshowthevalidityoftheproposedcontrolmethod.
标签： REPETITIVE learning CONTROL (RLC) UNKNOWN CONTROL

全文阅读

Some new criteria for lag synchronization of chaotic Lur＇e systems by replacing variables control

简介：Somenewcriteriaforthechaoticlagsynchronizationareproposed.Atfirst,lagsynchronizationschemeforidenticalmaster-slaveLur'esystemsbyreplacingvariablescontrolandtherelevanterrorsystemaregiven,andtherelationsbetweenabsolutestabilityoftheerrorsystemandthechaoticlagsynchronizationaredescribed.Then,basedonaquadraticLyapunovfunction,twonewLur'ecriteriafortheabovechaoticlagsynchronizationareproved.FourcorrespondingfrequencydomaincriteriaarefurtherderivedbymeansofMeyer-Kalman-YacuboviaLemma.Thesefrequencydomaincriteriaareappliedtoanalyzethelagsynchronizationofgeneralmaster-slaveChua'scircuitssothatsomerangesoftheparametersinwhichthemaster-slaveChua'scircuitsachievechaoticlagsynchronizationbyreplacingsingle-variablecontrolareattained.Finally,someexamplesaregiventoverifythetheoreticalresults.
标签：混沌同步变换控制 Lur'e系统延迟同步

全文阅读

On　absolute　stability　of　Lur＇e　control　systems　with　multiple　non-linearities　using　linear　matrix　inequalities

简介：NecessaryandsuffcientconditionsfortheexistenceofaLyapunovfunctionintheLur'eformtoguaranteetheabsolutestabilityofLur'econtrolsystemswithmultiplenon-linearitiesarediscussedinthispaper.Itsimplifiestheexistenceproblemtooneofsolvingasetoflinearmatrixinequalities(LMIs).IfthoseLMIsarefeasible,freeparametersintheLyapunovfunction,suchasthepositivedefinitematrixandthecoefficientsoftheintegralterms,aregivenbythesolutionoftheLMIs.Otherwise,thisLyapunovfunctiondoesnotexist.Somesufficientconditionsarealsoobtainedfortherobustabsolutestabilityofuncertainsystems.Anumericalexampleisprovidedtodemonstratetheeffectivenessoftheproposedmethod.
标签：线性矩阵控制系统绝对稳定性稳健性李雅普诺夫函数

全文阅读