基于V系统的Chernoff脸谱的聚类分析
详细信息    本馆镜像全文|  推荐本文 |  |   获取CNKI官网全文
摘要
V-系统是一类由分片多项式构成的L2[0,1]空间上的正交完备函数系,函数系中既有连续函数又有间断函数。0次V系统恰是Haar小波。对复杂的几何造型,可以用V级数的有限项精确表示。V系统还具有多分辨特性和局部性,这些性质在计算机图形学、计算机视觉、CAD/CAM、医学图像、科学计算等领域发挥重要的作用。本文利用V系统的特性,将其应用于Chernoff脸谱聚类问题。
     Chernoff脸谱图是多元统计学中关于多变量图示的一种经典表示方法,也是一种有效的数据可视化技术。1973年,美国的统计学家Chernoff首先把脸谱用于聚类分析中,Chernoff脸谱能在平面上直观、形象地反映出多变量数据之间的信息特征,是平面上表示高维图的一种重要手段。多变量图示法也日益受到人们的关注,并且应用于多元分析的各种应用之中。Chemoff脸谱图的提出使多元统计分析图形化有了进一步的发展。
     本文的主要工作如下:
     (1)利用V系统的特性,对含有间断信息的chernoff脸谱精确重构。并与经典的Fourier函数系,连续小波等作了重构比较,当Fourier函数系,连续小波被用于重构带有间断的几何模型时,都不可避免地要引起Gibbs现象,而V系统只需要有限个函数就可以精确重构它们,消除Gibbs现象。
     (2)利用V描述子对Chernoff脸谱进行特征量化,实现了Chernoff脸谱的计算机自适应聚类,避免了以往Chernoff脸谱聚类中人眼主观判断带来的误判,尤其当要处理的数据组较多时,这个方法更显优势。通过具体实例的检验,表明了该算法在Chernoff脸谱的聚类中简单、快捷、有效,聚类结果与统计学中的SAS软件聚类结果完全一致。
V-system is an orthogonal complete function on L_2[0,1], composed of piecewise polynomial, which includes not only continue functions, but also discontinue functions. The zero order V-system is just Haar wavelet. A complex geometric model can be expressed with finite terms of the V-series accurately. V-system has properties of multi-resolution and local support, which play an important role in computer graphics, computer vision, CAD/CAM, medical imaging, scientific calculation and so on. In this paper, the characteristic of V-system will be applied to cluster the Chernoff faces.
     The Chernoff face is a classical method to display multidimensional data graphically in multivariate statistics and an effective data visualization technology. The Chernoff face reflects the information features of multi-variable data, and be used as an important tool representing high-dimensional data. In 1973, Chernoff, an American statistician, proposed Chernoff faces for clustering analysis firstly, and multi-variable expression in the plane is increasingly concerned from then on. The presentation of Chernoff faces has developed the graphical multivriate statistical Analysis. And it is an important mean for people to understand the multi-dimensional space visualizing
     The central work of this paper is as follows:
     (1)The Chernoff faces are reconstructed accurately via finite terms of the V-series. However, Fourier function systems, continuous wavelet and almost all well-known classical orthogonal continuous systems have inevitably caused Gibbs phenomenon when they are used to reconstruct the geometric model with discontinuous information.
     (2)The V-descriptor is used to cluster analysis of the Cheronff faces. By quantifying the overall features of the Chernoff faces, we offer a new programmed clustering method for Chernoff faces. It can avoid misjudgment caused by human eyes. Especially when the number of data sets processing is very large, this approach is even more advantages. The concrete examples show that the clustering algorithm is a simple, fast, effective, and the clustering results are in line with the clustering results of the SAS statistical software.
引文
[1]Ruixia Song, Hui Ma, Tianjun Wang and Dongxu Qi, "Complete Orthogonal V-system and Its Applications", Communications on Pure and Applied Analysis,2007,6(3):853-871.
    [2]梁延研,宋瑞霞,齐东旭.完备正交V-系统与点云数据拟合,系统仿真学报,2006,18(8):2109-2113.
    [3]梁延研,宋瑞霞,王小春,齐东旭,完备正交V-系统及其在几何信息重构中的应用[J],计算机辅助设计与图形学学报,2007,19(7):.871-875.
    [4]Ruixia Song, Yanyan Liang, Xiaochun Wang, Dongxu Qi. "Elimination of Gibbs phenomenon in Computational Information based on the V-system ",In Proc. of The Second International Conference on Pervasive Computing and Applications,337-341, Birmingham, UK,2007, IEEE Press,
    [5]Ruixia Song, Meifang Ou, The Application of V-system in the Digital Image Transform, Proc. of the IEEE International Conference on Information Automation 2008, pp296-301
    [6]李坚,宋瑞霞等.基于三角域上V系统的三维几何模型的正交重构,计算机学报,2009,32(2)
    [7]马辉,宋瑞霞,王小春.V描述子与B样条曲线,计算机辅助设计与图形学学报,2006,18(11):1717-1722
    [8]齐东旭,分形及其计算机生成,科学出版社,1994,pp98.
    [9]宋瑞霞,马辉.信号多分辨分析的一类新的正交基,科学技术与工程,2005,5(23):1807-1812
    [10]宋瑞霞,马辉,王小春.带切向控制的曲线造型方法,计算机辅助设计与图形学学报,2006,18(3):396-400
    [11]齐东旭,陶尘均,宋瑞霞等.基于正交完备U-系统的参数曲线图组表达,计算机学报,2006,29(5):778-785
    [12]马辉,宋瑞霞,齐东旭.正交完备U系统及其在CAGD中的应用,工程图学学报,2006,27(3):108-114;
    [13]Ruixia Song, Xiaochun Wang, Hui Ma, Dongxu Qi. V-descriptor and Shape Similarity Measurement between B-spline Curves, Proceedings of The First International Symposium on Pervasive Computing and Applications,2006.p486-490, IEEE Press,
    [14]Yanyan Liang, Ruixia Song, Xiaochun Wang, Dongxu Qi. Surface Smoothing from Noisy Point Data Based on V-system, Hangzhou, China: Proceedings of 2nd China-Korea Joint Conference on Geometric and Visual Computing, (CKJC 2006),2006.
    [15]Ruixia Song, Hui Ma, Tianjun Wang, Dongxu Qi. Complete Orthogonal V-system and Its Applications, Communications on Pure and Applied Analysis,2007,6 (3):853-871.
    [16]Ruixia Song, Yanyan Liang, Xiaochun Wang, Dongxu Qi. Elimination of Gibbs phenomenon in Computational Information based on the V-system[C],In Proc. of The Second International Conference on Pervasive Computing and Applications,337-341, Birmingham, UK,2007, IEEE Press,
    [17]欧梅芳,宋瑞霞.V-系统在信息重构与字符识别中的应用探索,全国第一届图学大会会议文集,201-204,烟台,2007
    [18]Wang, Xiaochun; Liang, Yanyan; Ma, Hui; Song, Ruixia, Applications of complete orthogonal V-system with multiresolution property, Proceedings of 2007 10th IEEE International Conference on Computer Aided Design and Computer Graphics, CAD/Graphics 2007,P35,2007
    [19]Xiaochun Wang, Yanyan Liang, Meifang Ou, Ruixia Song. Application of Complete Orthogonal V-system, Proc. of The 2008 International Congress on Image and Signal Processing, pp 694-698, Sanya,2008
    [20]王小春,宋瑞霞.一类正交函数系的离散表示及快速变换,计算机工程与应用,2008,44(8):40-44
    [21]Ruixia Song, Xiaochun Wang, Meifang Ou, Jian Li, The Structure of V-system Over Triangulated Domains, Lecture Notes in Computer Science 4975, pp563-569, 2008
    [22]宋瑞霞.三角域上一类正交函数系的构造,系统科学与数学,2008,28(8):949-960
    [23]Ruixia Song,, Meifang Ou, The Application of V-system in the Digital Image Transform, Proc. of the IEEE International Conference on Information Automation 2008,pp296-301
    [24]李坚,宋瑞霞,叶梦杰,梁延研,齐东旭.V-系统与几何群组信息的频域表达,软件学报,2008年增刊:41-51
    [25]Ruixia Song, Xiaochun Wang, Jian Li. Feature Extraction for Digital Object in Pervasive Computing Environment, Proceedings of the Third International Conference on Pervasive Computing and Applications, Egypt,2008. pp163-168
    [26]李坚,宋瑞霞,梁延研,齐东旭.V-系统与三维模型的特征提取,DEA2008论文集,2008,
    [27]欧梅芳,宋瑞霞.V系统在图像消噪中的应用,中国图象图形学报,2009,14(7):1447~1452
    [28]李坚,宋瑞霞,叶梦杰,梁延研,齐东旭.基于三角域上V系统的三维几何模型的正交重构,计算机学报,2009,32(2):193-202
    [29]肖红兵,杨锦舟,鞠晓东,乔文孝.V系统在随钻声波测井数据降噪中的应用,中国石油大学学报,2009,33(2):58-69
    [30]叶梦杰,李坚,梁延研,唐泽圣;澳门文物三维模型的正交V系统表示[J];澳门科技大学学报,2008,2(1):1-10.
    [31]Ruixia Song, Xiaochun Wang, Meifang Ou, Jian Li, The Structure of V-system Over Triangulated Domains, Lecture Notes in Computer Science 4975, pp563-569, 2008
    [32]王茂森,邹建成,钟文琦,基于V系统的数字图象水印技术[J],哈尔滨工业大学学报,2006,38(增刊),893-896
    [33]H. Chernoff. The Use of Faces to Represent Points in k-Dimensional Space Graphically [J]. Journal of the American Statistical Association, vol.68,1973:361-368
    [34]Lawrence A. Bruckner. ON CHERIWFF FACES, Symposium on Graphical Representation of Multivariate Data, Naval Post-graduate School, Monterey, California.
    [35]Astel K. Classification of drinking water samples using the Chernoff s faces visualization approach, Polish Journal of Environmental Studies,2006,15(5):691-697
    [36]洪文学.基于多元统计图表示原理的信息融合和模式识别技术.国防工业出版社.
    [37]Xu R, Donald W H. Survey of clustering algorithms, IEEE Transactions on Neural network,2005,16(3):645-678
    [38]王金甲,洪文学,李昕,一种K-均值脸谱图聚类新算法,仪器仪表学报,2007,28(10):1916-1920
    [39]赵智杰,贾振邦等。应用脸谱图与地积累指数法综合评价沉积物中重金属污染的研究,环境科学,1993,14(4)
    [40]方开泰,多变量样本的图分析法,数学的实践与认识,1981,4:42-49
    [41]方开泰,实用多元统计分析,华东师范大学出版社,1989
    [42]陆漩译,实用多元统计分析,清华大学出版社,2001
    [43]齐大荃,汤茶中微量元素含量的多变量样本分析法,北京大学学报,1992,26(2):134-141
    [44]殷菲,潘晓平,吴震,Chernoff脸谱图的改进,中国卫生统计,2003,20(4):194-196
    [45]任永功,于戈.数据可视化的研究与进展,计算机科学.2004,31(12):92-96
    [46]任永功,于戈.一种基于颜色特征的运动对象分割方法.计算机工程,2004,30(7):126-127.
    [47]任永功,于戈.一种多维数据的聚类算法及其可视化研究.计算机学报,2005,28(11):1861-1865
    [48]王金甲,洪文学,李听,一种K-均值脸谱图聚类新算法,仪器仪表学报,2007,28(10):1916-1920
    [49]王金甲,李静,李昕,洪文学,着装脸谱图的分类新算法,燕山大学学报,2008,32(5):429-434
    [50]舒晓惠,上市公司财务绩效的评价方法研究,暨南大学硕士学位论文,2005年
    [51]任永功,面向聚类的数据可视化方法及相关技术研究,东北大学博士学位论文,2006.
    [52]Michael D. Leea, Marcus A. Butaviciusa, Rachel E. Reilly. Visualizations of binary data: A comparative evaluation, International Journal of Human-Computer Studies, 59(2003)569-602
    [53]Amanatiadis, V.G. Kaburlasos, A. Gasteratos, and S.E. Papadakis, A Comparative Study of Invariant Descriptors for Shape Retrieval, Proceedings of International Workshop on Imaging Systems and Techniques,2009.
    [54]Warner H, T hissen D. Graphical Data Analysis [J]. Annual Review of Psychology, 1981,32,191-241
    [55]Flury B, Riedwyl H. Graphical represention of multivariate data by means of asymmetrical faces [J]. Journal ofAmerican Statistical Association,1981,76 (6):757-765.
    [56]Kabulov B T. A method for constructing Chernoff faces oriented toward interval estimates of the parameters [J]. Soviet Journal of Computer and Systems Sciences, 1992,30 (3):94-97
    [57]Mahmut Tokmak(?)i.A classification system for stenosis from mitral valve doppler signals using adaptive network based fuzzy inference system [J]. Journal of Medical Systems,1995,32 (5):329-336
    [58]Morris J Christopher, Ebert S David, Rheingans Penny. Experimental analysis of the effectiveness of features in Chernoff faces [C]//28th AIPR Workshop:3D Visualization for Data Exploration and Decision Making, Proceedings of SPIE,2000: 12-17.
    [59]Su C P, Gupta M. White P. Multivariate sensory characteristics of low and ultra-low linoleic soybean oils displayed as faces. J of the American Oil Chemists Society, 2003,80(12):1231-1235
    [60]Darinka B V., Zdenka C K. Multivariate data analysis in classification of vegetable oils characterized by the content of fatty acids, Chemometrics and Intelligent Laboratory System,2005,75(1):31-43
    [61]Xu R, Donald W H. Survey of clustering algorithms, IEEE Transactions on Neural network,2005,16(3):645-678
    [62]Ren Yong-Gong,Yu Ge. An Image Post-Processing Method for Visual Data Mining. Wuhan University Jounral of Natural Sciences.2006,(11),1
    [63]Cor J Veenman, Marcel JT Reinders. The nearest subclass classifier: a compromise between the nearest mean and nearest neighbor classifier [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence,2006,27 (9):1417-1429.
    [64]Wang Jinjia, Song Jialin, Li Xin, et al.. An efficient Chernoff faces clustering algorithms [J]. Dynamics of Continuous, Discrete and Impulsive Systems,2006,30 (5): 1050-1052.
    [65]Wang Jinjia, Song Jialin, Li Xin, et al.. Chernoff faces classification algorithms [J]. Dynamics of Continuous, Discrete and Impulsive Systems,2007,31 (6):1000-1002
    [66]Mads Dyrholm, Christoforos Christoforou,Lucas C Parra. Bilinear discriminant component analysis [J]. The Journal of Machine Learning Research,2007,8 (6):1097- 1111.
    [67]柳重堪,正交函数及其应用,国防工业出版社,1982,152-153.
    [68]Haar. Zur theorie der orthogonalen funktionensysteme. Math. Ann.,1910,69:331-371.
    [69]Suslov S. K., An introduction to basic Fourier series, Kluwer Academic Publishers,2003, Chapter 9:1-7
    [70]Search Results for Orthogonal, http://www-groups.dcs.-and.ac.uk/-history/
    [71]G. Sansone. Orthogonal Functions. Dover, New York,1991
    [72]Donovan G. C., Geronimo J. S., Hardin D. P., Orthogonal polynomials and the construction of piecewise polynomial smooth wavelets, SIAM J. Math. Anal.1999,30(5):1029-1056
    [73]G. C. Donovan, J. S. Geronimo, and D. P. Hardin, Squeezable orthogonal bases:accuracy and smoothness, SIAM J. Numer. Anal,2002,40(3):1077-1099
    [74]Yuyu Feng, Dongxu Qi, A sequence of piecewise orthogonal polynomials, SIAM J.Math, Anal,1984,15(4):834-844
    [75]Charles A. Micchelli and Yuesheng Xu, Using the Matrix Refinement Equation for the Construction of Wavelets on Invariant Sets, Applied and Computational Harmonic Analysis 1,1994:391-401
    [76]N.E.Huang, Z.Shen, S.R.Long, M.C.Wu, H.H.Shih, Q.Sheng, N.Yen, C.C.Tung and H.H.Liu, The empirical mode decomposition and the Hilbert spectrum for nonlinear and non-stationary time series analysis, Proc. R. Soc. Land. A(1998)454, 903-995, MR1631591(99d:76028)
    [77]唐荣锡,彭群生,汪嘉业等.计算机图形学教程.北京:科学出版社,1990:56-115
    [78]彭群生,鲍虎军,金小刚.计算机真实感图形的算法基础[M].北京:科学出版社,1999
    [79]James D. Foley etc., Computer Graphics. Principles and Practice, Second Edition In C, Beijing, Engineering Industry Press,2002
    [80]王国谨等,计算机辅助几何设计.北京:高教出版社,施普林格出版社,2001
    [81]Hongbo Li. Hyperbolic conformal geometry with Clifford algebra. Interna-tional Journal of Theoretical Physics 40(1):2001,79-91.
    [82]A.J. Jerri, The Gibbs Phenomenon in Fourier Analysis, Splines and Wavelet (Mathematics and Its Applications), ISBN-10:0792351096 Springer; 2006
    [83]Peter D. Lax, Gibbs Phenomena, Journal of Scientific Computing Vol.28, No.2/3, September 2006
    [84]Gilbert G. Walter and Xiaoping Shen, Wavelets and Other Orthogonal Systes (second edition), Chapman & Hall/CRC,2001
    [85]Yves Meyer, Wavelets:Algorithms and applications, translated by Robert D. Ryan, ISBN 0-89871-309.9, SIAM,1993
    [86]Henning F. Harmuth, Sequency Theory Foundations and Applications, Posts & Telecom Press,1977
    [87]张其善,张有光.桥函数理论及其应用.北京:国防工业出版社,1992
    [88]Yves Meyer, Wavelets:Algorithms and applications, translated by Robert D. Ryan, ISBN 0-89871-309.9, SIAM,1993
    [89]Farin G. Curves and Surfaces for Computer Aided Geometric Design (4th edition). Academic Press,1997
    [90]Piegl L A, Tiller W. The NURBS Book(2nd Edition). Springer,1997
    [91]Prautzsch H, Boehm W, Paluszny M. Bezier and B Spline Techniques. Springer-Verlag,2002
    [92]Zorin D, Schroder P, DeRose T, Kobbelt L, Levin A, Sweldens W. Subdivision for modeling and animation. In SIGGRAPH 2000 Course Notes,2000
    [93]Botsch M, Pauly M, Kobbelt L, Alliez P, Leziervy B, Bischoff S, Rossl C. Geometric Modeling Based on Polygonal Meshes. In SIGGRAPH 2007 Course Notes,2007
    [94]施法中.计算机辅助几何设计与非均匀有理B样条.北京:高等教育出版社,2001
    [95]郑君里,应启珩,杨为理.信号与系统.北京:高等教育出版社,(第二版)2000
    [96]张其善,张有光.桥函数理论及其应用.北京:国防工业出版社,1992