建模连续视觉特征的图像语义标注方法

李志欣 已出版文章查询
李志欣
本平台内已出版文章查询
1 施智平 已出版文章查询
施智平
本平台内已出版文章查询
2 刘曦 已出版文章查询
刘曦
本平台内已出版文章查询
3 史忠植 已出版文章查询
史忠植
本平台内已出版文章查询
2

+ 作者地址

1中国科学院计算技术研究所智能信息处理重点实验室,北京,100190;广西师范大学计算机科学与信息工程学院,桂林,541004;中国科学院研究生院,北京,100049

2中国科学院计算技术研究所智能信息处理重点实验室,北京,100190

3中国科学院计算技术研究所智能信息处理重点实验室,北京,100190;中国科学院研究生院,北京,100049


0
  • 摘要
  • 参考文献
  • 相关文章
  • 统计
针对图像检索中存在的"语义鸿沟"问题,提出一种对连续视觉特征直接建模的图像自动标注方法.首先对概率潜语义分析(PLSA)模型进行改进,使之能处理连续量,并推导对应的期望最大化算法来确定模型参数;然后根据不同模态数据各自的特点,提出一个对不同模态数据分别处理的图像语义标注模型,该模型使用连续PLSA建模视觉特征,使用标准PLSA建模文本关键词,并通过不对称的学习方法学习2种模态之间的关联,从而能较好地对未知图像进行标注.通过在一个包含5000幅图像的标准Corel数据集中进行实验,并与几种典型的图像标注方法进行比较的结果表明,文中方法具有更高的精度和更好的效果.

[1] Smeulders AWM.;Santini S.;Gupta A.;Jain R.;Worring M. .Content-based image retrieval at the end of the early years [Review][J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2000(12):1349-1380.

[2] RITENDRA DATTA;DHIRAJ JOSHI;JIA LI;JAMES Z. WANG .Image Retrieval: Ideas, Influences, and Trends of the New Age[J].ACM computing surveys,2008(2):35-94.

[3] 李志欣,Shi Zhiping,李志清,Shi Zhongzhi.图像检索中语义映射方法综述[J].计算机辅助设计与图形学学报,2008(08):1085-1096.

[4] THOMAS HOFMANN .Unsupervised Learning by Probabilistic Latent Semantic Analysis[J].Machine learning,2001(1/2):177-196.

[5] David M. Blei;Andrew Y. Ng;Michael I. Jordan .Latent Dirichlet Allocation[J].Journal of machine learning research,2003(4/5):993-1022.

[6] David M. Blei;Michael I. Jordan .Modeling Annotated Data[J].ACM SIGIR forum,2003(special):127-134.

[7] Monay Florent;Gatica-Perez Daniel .Modeling Semantic Aspects for Cross-Media Image Indexing[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2007(10):1802-1817.

[8] Li Z X;Shi Z P;Li Z Q.Modeling latent aspects for automatic image annotation[A].Los Alamitos.CA:IEEE Computer Society Press,2009:1857-1860.

[9] Gustavo Carneiro;Antoni B. Chan;Pedro J. Moreno;Nuno Vasconcelos .Supervised Learning of Semantic Classes for Image Annotation and Retrieval[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2007(3):394-410.

[10] Lavrenko V;Manmatha R;Jeon J.A model for learning the semantics of pictures[A].Cambridge,MA:The MIT Press,2003:553-560.

[11] Feng S L;Manmatha R;Lavrenko V.Multiple Bernoulli relevance models for image and video annotation[A].Los Alamitos.CA:IEEE Computer Society Press,2004:1002-1009.

[12] Zhang R F;Zhang Z F;Li M J.A probabilistic semantic model for image annotation and multi-modelimage retrieval[A].Los Alamitos.CA:IEEE Computer Society Press,2005:846-851.

[13] Li J;Wang J Z .Automatic linguistic indexing of pictures by a statistical modeling approach[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,2003,25(09):1075-1088.

[14] Chang E.;Kingshy Goh;Sychay G.;Gang Wu .CBSA: content-based soft annotation for multimodal image retrieval using Bayes point machines[J].IEEE Transactions on Circuits and Systems for Video Technology,2003(1):26-38.

[15] Duygulu P;Barnard K;de Freitas J F G.Object recognition as machine translation:learning a lexicon for a fixed image vocabulary[A].Heidelberg:Springer-Verlag,2002:97-112.

[16] Barnard K;Duygulu P;Forsyth D et al.Matching words and pictures[J].Journal of Machine Learning Research,2003,3(02):1107-1135.

[17] J. Jeon;V. Lavrenko;R. Manmatha .Automatic Image Annotation and Retrieval using Cross-Media Relevance Models[J].ACM SIGIR forum,2003(special):119-126.

[18] Bishop C M.Pattern recognition and machine learning[M].New York:springer-verlag,2006

[19] Dempster A P;Laird N M;Rubin D B .Maximum likelihood from incomplete data via the EM algorithm[J].Journal of the Royal Statistical Society,1977,39(01):1-38.

[20] Ormoneit D.;Tresp V. .Averaging, maximum penalized likelihood and Bayesian estimation for improving Gaussian mixture probability density estimates[J].IEEE Transactions on Neural Networks,1998(4):639-650.

[21] Huang J;Kumar S R;Mitra M et al.Spatial color indexing and applications[J].International Journal of Computer Vision,1999,35(03):245-268.

[22] Manjunath B.S.;Ma W.Y. .Texture features for browsing and retrieval of image data[J].IEEE Transactions on Pattern Analysis and Machine Intelligence,1996(8):837-842.


语种: 中文   

基金国家"九七三"重点基础研究发展计划项目(2007CB311004)

关键词图像自动标注 概率潜语义分析 主题模型 连续视觉特征 图像检索


期刊热词
  • + 更多
  • 字体大小