面向大规模日志数据的聚类算法研究_参考网

面向大规模日志数据的聚类算法研究

2012-04-29李清沈彤关毅

智能计算机与应用 2012年5期

关键词：李清标识码日志

李清沈彤关毅

摘要：针对大规模日志数据的聚类问题，提出了DBk-means算法。该算法使用Hadoop对原始日志数据进行预处理，并结合了k-means和DBSCAN聚类算法各自的优势。实验结果表明，相比k-means算法进行聚类分析，文中使用DBk-means算法进行聚类，能够取得更好的聚类效果，正确率可以达到83%以上。

关键词：

中图分类号：TP391文献标识码：A文章编号：2095-2163（2012）05-0042-04

猜你喜欢

李清标识码日志

发光的招牌

一名老党员的工作日志

Process Mineralogy of a Low Grade Ag-Pb-Zn-CaF2 Sulphide Ore and Its Implications for Mineral Processing

Study on the Degradation and Synergistic/antagonistic Antioxidizing Mechanism of Phenolic/aminic Antioxidants and Their Combinations

A Comparative Study of HER2 Detection in Gastroscopic and Surgical Specimens of Gastric Carcinoma

Significance of 18F—FDG PET / CT imaging in the evaluation of the efficacy of lymphoma

一种基于粗集和SVM的Web日志挖掘模型

智能计算机与应用

智能计算机与应用的其它文章