书签分享收藏举报版权申诉 / 10

立即下载加入VIP,免费下载

当前位置：首页 > IT计算机 > 电脑基础知识 > 文献翻译.docx

文献翻译.docx

文档编号：1962884
上传时间：2023-05-02
格式：DOCX
页数：10
大小：24.63KB

文献翻译.docx

《文献翻译.docx》由会员分享，可在线阅读，更多相关《文献翻译.docx（10页珍藏版）》请在冰点文库上搜索。

文献翻译.docx

文献翻译

ContentslistsavailableatScienceDirect

目录列表可用在Elsevier是荷兰一家全球著名的学术期刊出版商

Neurocomputing

神经计算学

journalhomepage杂志主页

Set-labelmodelinganddeepmetriclearningonpersonre-identification

标签建模与深层度量学习方法在人体识别系统中的应用

刘昊:

a

马彭兵:

a

秦雷:

b,*

庞俊彪:

c

张春杰：

a

黄庆明：

a,b

a,中国学术科技大学计算机与控制工程中国背景100190

b,中国院士（CAS），中国科学院计算技术研究所，中科院，北京100190，中国的智能信息处理重点实验室

c,北京科技大学城市交通学院，多媒体与智能软件技术重点实验室。

articleinfo文章资讯

文章历史

Received接获时间2014年7月27日

接获修订格式2014年10月21日

通过2014年11月2日

由刘青山传达

上线2014年11月11日

关键字:

人重新定义

交互信息

量度学习

深度学习

领里成分分析

摘要

Personre-identificationaimsatmatchingindividualsacrossmultiplenon-overlappingadjacentcameras.。

人体身份识别的主要目的在于通过多个不重叠相邻摄像机匹配个人.

Bycondensingmultiplegalleryimagesofapersonasawhole,weproposeanovelmethodnamedSet-LabelModel（SLM）toimprovetheperformanceofpersonre-identificationunderthemulti-shotsetting.

通过筛检多余无关图库图片来突出一个人作为一个整体，我们在多重拍摄的背景下提出了一种新的方法并命名为标签集模型（SLM）用来提高人体识别的性能。

Moreover,weutilizemutual-informationtomeasuretherelevancebetweenqueryimageandgallerysets.

此外，我们利用交互信息来衡量查询图像和图库设置之间的相关性

Todecreasethecomputationalcomplexity,weapplyaNaive–BayesNearest-Neighboralgorithmtoapproximatethemutual-informationvalue.

为了降低计算的复杂性，我们采用一个朴素贝叶斯的临近算法去估算交互信息值。

Toovercomethelimitationsoftraditionallinearmetriclearning,wefurtherdevelopadeepnon-linearmetriclearning（DeepML）approachbasedonNeighborhoodComponentAnalysisandDeepBeliefNetwork.

为了克服传统的线性度量学习的局限性，我们进一步发展基于领域

构件分析和深度信念网络的深入的非线性度量学习方法。

Toevaluatetheeffectivenessofourproposedapproaches,SLMandDeepML,wehavecarriedoutextensiveexperimentsontwochallengingdatasetsi-LIDSandETHZ.

为了评估我们所提出的方案的有效性，标签集模型和非线性度量,我们已经在Linux入侵侦察系统和瑞士理工学院两个有权威性的数据集上进行大量的实验。

Theexperimentalresultsdemonstratethattheproposedmethodscanobtainbetterperformancescomparedwiththestate-of-the-artmethods.

实验结果表明，与最先进的方法相比我们所提出的方法可以得到更好的性能。

Intherecentyears,thetaskofpersonre-identification（Re-Id）isbecominglargelyattractiveinvideosurveillance.Itaimstomatchpeopleacrossmultiplenon-overlappingcameras,forexample,identifypeopleacrossmulti-viewcamerasinthemulticameranetwork,orrecognizetheidenticalpersonwhodisappearedinonecameraandappearedinanothercameralater.Italsocanbeembeddedinwidespreadapplicationssuchastrackingandtargetre-acquisition.

在最近几年，人体识别的课题在视频监控上引起了很大的关注。

它的主要目的在于通过多个不重叠相邻摄像机匹配人体.例如，在多相机网络下通过多试图摄像机，或是识别消失在一台摄像机和出现在另一台摄像机后的同一个人。

它也可以被嵌套在广泛的应用程序中比如跟踪和目标重采集。

Accordingtotheexperimentalsetting,themethodsofRe-Idcanbedividedintotwogroups,single-shotandmulti-shot.Theformergroupselectsonlyoneimageforeachperson,whilethelattergroupdescribesmultipleimagesasasignatureforeachpersonId（classlabel）.Re-Idisachallengingproblem,sinceitsuffersilluminationchanges,low-resolution,andviewvariationsinmultiplecameras.Forrecentbesteffortsfromresearchers,onekindoftheRe-Idmethodsfocusesondesigningdiscriminativefeatures[1–7].Byutilizingthesupervisedinformation,theotherkindofmethodsaimsatfindingagloballineartransformationtoreweightfeaturedimensions（e.g.learningaMahalanobisdis-tance）[8–11].

根据实验设定，人体识别的方法可以被分为两组，单次拍摄和多次拍摄。

前一组每人只选择着一张图片，而后一组描述多个图片为每一个人做一个标记（标签集）。

人体识别是一个具有挑战性的问题，因为它受到光照变化，低分辨率和查看变化多台摄像机的影响。

最近研究人员做了最大的努力，一种集中设计面貌差异的人体识别方法。

通过采用监控信息，得出是一种找出一个全局性的线性转换加权特征尺寸（e.learning马氏距离）的另一种方法

Inthispaper,wefirstproposeaSet-LabelModelnamedSLMapproachtoimprovetheperformanceofpersonre-identificationunderthemulti-shotsetting.TherearethreestepsforSLM.Firstly,wedefineaset-basedstructureforeachclass,whichcontainsconcatenatedfeaturesbetweenthequeryfeatureandthegalleryfeatures.Inthefollowing,weuseSETtoreplacetheset-basedstructureforsimplicity.Weutilizemutual-informationtomeasuretherelationshipbetweenfeaturesw.r.ttheirclasslabel.Secondly,sincethefeaturesdistributionofconditionalprobabilitycanbehardlyassumed,weapplyaNaive–BayesNearest-Neighboralgorithm（NBNN）[12]toapproximatethemutual-informationvalueinsteadofdirectlyaccessingtheprobabilityform.Inaddition,theNBNNalgorithmcanalsoprovideasignificantefficiency.Finally,themutual-informationvaluescanberankedinadescendingorderandthecorrespondingclasslabelofthehighestvalueisassignedtothequery.

在本文中，我们首先提出了一个名为SLM方法，为了在连续拍摄下

提高人体识别的性能。

SLM有三个步骤。

首先，我们为每个类定义一组基础结构，其中包含了查询功能和库功能之间的级联功能。

在下文中，为了简单起见我们使用SET，以取代该组组为基础的结构。

我们利用交互信息来衡量在特点W.R.T与它们的类标签之间的关系。

其次，由于几乎不能假设条件概率的特征。

我们应用一个朴素贝叶斯最临近算法（N-NN）[12]来近似多交互信息的值而不是直接访问他的可能性。

此外，朴素贝叶斯最临近算法也可以提供一个显著效率。

最后，交互信息值可以排在一个递减的次序和最高值中的对应类标签分配一个查询。

Byutilizingthelabeleddata,wefurtherdevelopadeepnonlinearmetriclearningmethodnamedDeepMLbasedonNeighborhoodComponentAnalysis（NCA）[13]andDeepBeliefNetwork（DBN）[14].NCAaimstomaximizetheexpectednumbersofclassifiedsampleintrainingdataviaadatatransformation.ByNCA,animprovementcanbeperformedonthealgorithms,whicharebasedoncomputingthedistanceoftwofeatures（suchask-nearest-neighborclassification）.ToextendthedatatransformationinNCA,weutilizeDBNtolearnanonlinearfeaturetransformation.NCAisplacedonthetoplayerofRBMtoadjusttheweightsoftoplayer.Thenfine-tuningiscarriedouttoadjusttheweightsofotherlayers.Bythisway,thediscriminativepoweroffeaturescanbeenhancedbythelearnednewmetric（transformation）.

利用标记的数据，我们进一步发展了命名为DeepML的基于领域构件分析的深度非线性度量学习方法和深信度网络，领域构件分析目的在于通过数据转换最大限度地训练数据的分类样本。

通过NCA

Therearetwomaincontributionsforthispaper.

（1）Wemodeltherelevanceamongmultipleimagefeaturesbymutual-information.Furthermore,weapplyanapproximatealgorithmNBNNtovaluethemutual-informationinsteadofdirectlyaccessingtheprobabilityform.Asourknowledge,itisthefirsttimethatmutual-informationtheoryisappliedinthetaskofpersonre-identification.

（2）Byconsideringthelabeledimages,wedevelopadeepnon-linearmetriclearningmethodtoimprovethediscriminativepowerofourfeatures.Asmostmetriclearningmethodsfocusonlearningalineartransformation,weapplydeeplearningarchitecturetoprovideanon-linearmappingfromtheoriginfeaturestonewnonlinearfeatures.

WeevaluateSLMandDeepMLontwobenchmarkdatasetsi-LIDSandETHZ,bothofwhichhavemultipleimagesperpersonandareundergoingthechangesofillumination,viewangle,low-resolutionandocclusion.TheexperimentalresultsdemonstratethatSLMcanobtain100percentmatchingaccuracywithsimplecolorfeatures（HSV）onETHZafterrank3,andDeepMLcangainadditionalimprovementsbycombiningSLMwithdeepnon-linearmetriclearning.

Theremainderofthispaperisorganizedasfollows:

relatedworksareintroducedinSection2.ThedetailsofSLMandDeepMLaredescribedinSection3.TheexperimentalperformanceandresultsarepresentedinSection4.Finally,wedrawsomeconclusionsandputforwardfutureworksinSection5.

2.Relatedwork

Recently,thetaskofpersonre-identification,aimingatmatchingthesameindividualacrossmultipledisjointcameras,hasobtainedincreasingattentioninvideosurveillance.ToimprovetheperformanceofRe-Id,existingworksmainlyfocusontwoaspects,appearancefeatureextractionanddistancemetriclearning.

Theappearancebasedmethodsmainlyrelyondesigningdescriptivefeaturessuchaslow-dimensionaldiscriminativefeatures[1],viewpointinvariancefeatures[2,15],accumulationofmultiplefeatures[3],combinationofbothlocalandglobalfeatures[4],bio-inspiredfeatures[5,7],discriminativefeaturesbyattributes[6]andFishervectorencodedfeatures[7].

Differentfromtheappearancebasedmethods,othermethodsinpersonre-identificationcaremoreabouthowtousethemetriclearningmethodtoimprovethemeasureofthefeatures[8–11,16,17].Byafeaturemapping,thesemethodsprojecttheoriginalfeaturesintoanotherfeaturespace.Thetraditionalmetriclearningapproachessuchas[16,17]aimtolearnanoptimaltransformationtoweightthefeatures.Bythistransformation,thetruematchesareclusteredcloser,andthefalsematchesarepushedfarther.Inthisway,abudgetofmetriclearningbasedmethodsspecificforRe-Idhasbeenproposed.Ref.[8]developsafastandscalablemethodoflearningmetricbyinferenceoflikelihoodratiotest.Ref.[11]formulatesRe-Idasadistancecomparisonlearningproblembymaximizingtheprobabilitybetweenatruematchandafalsematch.Mostofthesemethodsarelimitedtolearnjustalineartransformationtoprovideaprojectionfromthesourcefeaturespaceontothetargetspace.

近期,人体识别主要针对匹配同一个人在多个不相交的摄像头,因此视频监控已得到越来越多的关注。

提高人体识别的性能,现有的工作主要集中在两个方面,外观特征提取和距离度量学习。

　　基于外观的方法主要依赖于设计描述性特性,如低维区别的功能[1],角度不变性特征[2,15],积累多个特性[3],两者的结合局部和全局特征[4],[5、7]仿生特性,区别的特征,属性[6][7]和费舍尔矢量编码的特性。

　　不同于基于外观的方法,其他方法鉴定更关心如何使用度量学习方法提高测量的特性（8-11、16、17）。

通过特征映射,这些方法项目将原始特性映射到另一个特征空间。

传统度量学习方法（16、17）的目标是学习最优转换重量的特性。

这种正确的会转变地更紧凑,反之推得更远。

通过这种方式,预算指标为基础的学习方法的具体人体识别已提上日程。

文献[8]通过检验推论可快速发展，学习可伸缩的度规似然方法[11]人体识别的制定同距离的学习比较问题,是正确和错误匹配之间的最可能的最大化效率。

大多数这些方法是有限的学习，只是一个线性变换，提供了一个从源特征空间投影到目标空间。

Accordingtothewaysofverifications,Re-Idcanbegroupedintotwocategories:

single-shotandmulti-shot.Generally,tovalidatetheeffectivenessofaRe-Idapproach,weshouldrandomlychoosethesamenumberofimages（candidates）fromeachpersonfirst,andthengroupthesecandidateswiththeirlabelsasagalleryset.Inthesingle-shotsetting,thereisonlyoneimageforeachpersoninthegalleryset[1,2].Sincesingleimageofthetargetcanhardlycoverthechangesofmulti-pose,multi-cameraandillumination,thetraditionalapproacheshaveobtainedfewimprovementsunderthesingle-shotse