By Junjie Wu
Nearly we all know K-means set of rules within the fields of information mining and enterprise intelligence. however the ever-emerging information with super advanced features convey new demanding situations to this "old" set of rules. This publication addresses those demanding situations and makes novel contributions in developing theoretical frameworks for K-means distances and K-means dependent consensus clustering, deciding upon the "dangerous" uniform influence and zero-value predicament of K-means, adapting correct measures for cluster validity, and integrating K-means with SVMs for infrequent category research. This publication not just enriches the clustering and optimization theories, but additionally offers stable tips for the sensible use of K-means, particularly for vital projects resembling community intrusion detection and credits fraud prediction. The thesis on which this booklet relies has received the "2010 nationwide very good Doctoral Dissertation Award", the top honor for no more than a hundred PhD theses in line with 12 months in China.
Read or Download Advances in K-means Clustering: A Data Mining Thinking (Springer Theses) PDF
Best data mining books
The single booklet to hide and evaluate Oracle's on-line analytic processing items With the purchase of Hyperion structures in 2007, Oracle unearths itself possessing the 2 such a lot able OLAP items at the market--Essbase and the OLAP choice to the Oracle Database. Written by way of the main a professional specialists on either Essbase and Oracle OLAP, this Oracle Press advisor explains how those items are comparable and the way they vary.
Information Mining and knowledge Visualization specializes in facing large-scale information, a box quite often often called facts mining. The booklet is split into 3 sections. the 1st bargains with an advent to statistical facets of knowledge mining and computer studying and contains functions to textual content research, laptop intrusion detection, and hiding of data in electronic documents.
This ebook unravels the secret of massive facts computing and its energy to remodel enterprise operations. The technique it makes use of could be worthwhile to any specialist who needs to current a case for figuring out colossal facts computing recommendations or to people who may be occupied with a massive information computing undertaking. It presents a framework that allows company and technical managers to make optimum judgements priceless for the winning migration to important information computing environments and purposes inside of their organisations.
The whole consultant to information technological know-how with Hadoop—For Technical pros, Businesspeople, and scholars call for is hovering for execs who can resolve genuine info technology issues of Hadoop and Spark. sensible info technology with Hadoop® and Spark is the complete advisor to doing simply that.
- Data Analysis (Digital Signal and Image Processing)
- Advanced Computer and Communication Engineering Technology: Proceedings of the 1st International Conference on Communication and Computer Engineering (Lecture Notes in Electrical Engineering)
- Shale Analytics: Data-Driven Analytics in Unconventional Resources
- Beginning SQL Server Reporting Services
- Econophysics Approaches to Large-Scale Business Data and Financial Crisis: Proceedings of Tokyo Tech-Hitotsubashi Interdisciplinary Conference + APFA7
- Apache Hive Cookbook
Extra info for Advances in K-means Clustering: A Data Mining Thinking (Springer Theses)