聚类分析外文文献及翻译.doc

上传人:s9****2 文档编号:546306203 上传时间:2024-03-14 格式:DOC 页数:15 大小:739KB
返回 下载 相关 举报
聚类分析外文文献及翻译.doc_第1页
第1页 / 共15页
聚类分析外文文献及翻译.doc_第2页
第2页 / 共15页
聚类分析外文文献及翻译.doc_第3页
第3页 / 共15页
聚类分析外文文献及翻译.doc_第4页
第4页 / 共15页
聚类分析外文文献及翻译.doc_第5页
第5页 / 共15页
点击查看更多>>
资源描述

《聚类分析外文文献及翻译.doc》由会员分享,可在线阅读,更多相关《聚类分析外文文献及翻译.doc(15页珍藏版)》请在金锄头文库上搜索。

1、本科毕业论文外文文献及译文文献、资料题目: Cluster AnalysisBasic Concepts and Algorithms文献、资料来源:文献、资料发表(出版)日期:院 (部): 土木工程学院专 业: 土木工程班 级: 姓 名: 学 号: 指导教师:翻译日期: 外文文献:Cluster AnalysisBasic Concepts and AlgorithmsCluster analysis divides data into groups (clusters) that are meaningful, useful,or both. If meaningful groups ar

2、e the goal, then the clusters should capture the natural structure of the data. In some cases, however, cluster analysis is only a useful starting point for other purposes, such as data summarization. Whether for understanding or utility, cluster analysis has long played an important role in a wide

3、variety of elds: psychology and other social sciences, biology,statistics, pattern recognition, information retrieval, machine learning, and data mining.There have been many applications of cluster analysis to practical problems. We provide some specic examples, organized by whether the purpose of t

4、he clustering is understanding or utility.Clustering for Understanding Classes, or conceptually meaningful groups of objects that share common characteristics, play an important role in how people analyze and describe the world. Indeed, human beings are skilled at dividing objects into groups (clust

5、ering) and assigning particular objects to these groups (classication). For example, even relatively young children can quickly label the objects in a photograph as buildings, vehicles, people, animals, plants, etc. In the context of understanding data, clusters are potential classes and cluster ana

6、lysis is the study of techniques for automatically nding classes. The following are some examples:Biology. Biologists have spent many years creating a taxonomy (hierarchical classication) of all living things: kingdom, phylum, class,order, family, genus, and species. Thus, it is perhaps not surprisi

7、ng that much of the early work in cluster analys is sought to create a discipline of mathematical taxonomy that could automatically nd such classication structures. More recently, biologists have applied clustering to analyze the large amounts of genetic information that are now available. For examp

8、le, clustering has been used to nd groups of genes that have similar functions. Information Retrieval. The World Wide Web consists of billions of Web pages, and the results of a query to a search engine can return thousands of pages. Clustering can be used to group these search results into a small

9、number of clusters, each of which captures a particular aspect of the query. For instance, a query of “movie” might return Web pages grouped into categories such as reviews, trailers, stars, and theaters. Each category (cluster) can be broken into subcategories (sub-clusters), producing a hierarchic

10、al structure that further assists a users exploration of the query results. Climate. Understanding the Earths climate requires nding patternsin the atmosphere and ocean. To that end, cluster analysis has been applied to nd patterns in the atmospheric pressure of polar regions and areas of the ocean

11、that have a signicant impact on land climate. Psychology and Medicine. An illness or condition frequently has a number of variations, and cluster analysis can be used to identify these different subcategories. For example, clustering has been used to identify different types of depression. Cluster a

12、nalysis can also be used to detect patterns in the spatial or temporal distribution of a disease. Business. Businesses collect large amounts of information on current and potential customers. Clustering can be used to segment customers into a small number of groups for additional analysis and market

13、ing activities.Clustering for Utility:Cluster analysis provides an abstraction from individual data objects to the clusters in which those data objects reside. Additionally, some clustering techniques characterize each cluster in terms of a cluster prototype; i.e., a data object that is representati

14、ve of the other objects in the cluster. These cluster prototypes can be used as the basis for a number of data analysis or data processing techniques. Therefore, in the context of utility, cluster analysis is the study of techniques for nding the most representative cluster prototypes. Summarization

15、. Many data analysis techniques, such as regression or PCA, have a time or space complexity of O(m2) or higher (where m is the number of objects), and thus, are not practical for large data sets. However, instead of applying the algorithm to the entire data set, it can be applied to a reduced data s

16、et consisting only of cluster prototypes. Depending on the type of analysis, the number of prototypes, and the accuracy with which the prototypes represent the data, the results can be comparable to those that would have been obtained if all the data could have been used. Compression. Cluster prototypes can also be used for data compres-sion. In particular, a table is created that consists of

展开阅读全文
相关资源
相关搜索

当前位置:首页 > 商业/管理/HR > 其它文档 > 租房合同

电脑版 |金锄头文库版权所有
经营许可证:蜀ICP备13022795号 | 川公网安备 51140202000112号