## Advanced Data Mining and Applications: Third International Conference, ADMA 2007, Harbin, China, August 6-8, 2007 ProceedingsThe Third International Conference on Advanced Data Mining and Applications (ADMA) organized in Harbin, China continued the tradition already established by the first two ADMA conferences in Wuhan in 2005 and Xi’an in 2006. One major goal of ADMA is to create a respectable identity in the data mining research com- nity. This feat has been partially achieved in a very short time despite the young age of the conference, thanks to the rigorous review process insisted upon, the outstanding list of internationally renowned keynote speakers and the excellent program each year. The impact of a conference is measured by the citations the conference papers receive. Some have used this measure to rank conferences. For example, the independent source cs-conference-ranking.org ranks ADMA (0.65) higher than PAKDD (0.64) and PKDD (0.62) as of June 2007, which are well established conferences in data mining. While the ranking itself is questionable because the exact procedure is not disclosed, it is nevertheless an encouraging indicator of recognition for a very young conference such as ADMA. |

### What people are saying - Write a review

We haven't found any reviews in the usual places.

### Contents

Mining Ambiguous Data with Multiinstance Multilabel Representation | 1 |

A Lazy Approach for Mining Frequent Patterns over High Speed Data Streams | 2 |

Exploring Content and Linkage Structures for Searching Relevant Web Pages | 15 |

CLBCRAApproach for Combination of ContentBased and LinkBased Ranking in Web Search | 23 |

Rough Sets in Hybrid Soft Computing Systems | 35 |

Discovering Novel Multistage Attack Strategies | 45 |

Privacy Preserving DBSCAN Algorithm for Clustering | 57 |

A New Multilevel Algorithm Based on Particle Swarm Optimization for Bisecting Graph | 69 |

Prediction of Protein Subcellular Locations by Combining KLocal Hyperplane Distance Nearest Neighbor | 345 |

A Similarity Retrieval Method in Brain Image Sequence Database | 352 |

A Criterion for Learning the DataDependent Kernel for Classification | 365 |

Topic Extraction with AGAPE | 377 |

Clustering Massive Text Data Streams by Semantic Smoothing Model | 389 |

A Novel Approximate Mining Approach of Sequential Patterns over Data Stream | 401 |

A Novel Greedy Bayesian Network Structure Learning Algorithm for Limited Data | 412 |

Optimum Neural Network Construction Via Linear Programming Minimum Sphere Set Covering | 422 |

Supervised Neighborhood Preserving Embedding | 81 |

A kAnonymity Clustering Method for Effective Data Privacy Preservation | 89 |

LSSVM with Fuzzy Preprocessing Model Based Aero Engine Data Mining Technology | 100 |

A Coding Hierarchy Computing Based Clustering Algorithm | 110 |

Mining Both Positive and Negative Association Rules from Frequent and Infrequent Itemsets | 122 |

Survey of Improving Naive Bayes for Classification | 134 |

Privacy Preserving BIRCH Algorithm for Clustering over Arbitrarily Partitioned Databases | 146 |

Unsupervised Outlier Detection in Sensor Networks Using Aggregation Tree | 158 |

Sifting Hierarchical Heavy Hitters Accurately from Data Streams | 170 |

Spatial Fuzzy Clustering Using Varying Coefficients | 183 |

Collaborative Target Classification for Image Recognition in Wireless Sensor Networks | 191 |

Dimensionality Reduction for Mass Spectrometry Data | 203 |

The Study of Dynamic Aggregation of Relational Attributes on Relational Data Mining | 214 |

Learning Optimal Kernel from Distance Metric in Twin Kernel Embedding for Dimensionality Reduction and Visualization of Fingerprints | 227 |

Efficiently Monitoring Nearest Neighbors to a Moving Object | 239 |

A Novel Text Classification Approach Based on Enhanced Association Rule | 252 |

Applications of the Moving Average of nᵗʰOrder Difference Algorithm for Time Series Prediction | 264 |

Inference of Gene Regulatory Network by Bayesian Network Using MetropolisHastings Algorithm | 276 |

A Consensus Recommender for Web Users | 287 |

Constructing Classification Rules Based on SVR and Its Derivative Characteristics | 300 |

Hiding Sensitive Associative Classification Rule by Data Reduction | 310 |

AOGags Algorithms and Applications | 323 |

A Framework for Titled Document Categorization with Modified Multinomial Naivebayes Classifier | 335 |

Networks | 430 |

Prediction of Enzyme Class by Using Reactive Motifs Generated from Binding and Catalytic Sites | 442 |

Bayesian Network Structure Ensemble Learning | 454 |

Fusion of Palmprint and Iris for Personal Authentication | 466 |

Enhanced Graph Based Genealogical Record Linkage | 476 |

A Fuzzy Comprehensive Clustering Method | 488 |

A Novel Classification Algorithm Based on Concept Similarity | 500 |

A Retrospective Analysis | 508 |

Chinese Patent Mining Based on Sememe Statistics and KeyPhrase Extraction | 516 |

Classification of Business Travelers Using SVMs Combined with Kernel Principal Component Analysis | 524 |

Research on the Traffic Matrix Based on Sampling Model | 533 |

A Causal Analysis for the Expenditure Data of Business Travelers | 545 |

A Visual and Interactive Data Exploration Method for Large Data Sets and Clustering | 553 |

Explorative Data Mining on Stock Data Experimental Results and Findings | 562 |

Graph Structural Mining in Terrorist Networks | 570 |

Characterizing Pseudobase and Predicting RNA Secondary Structure with Simple HType Pseudoknots Based on Dynamic Programming | 578 |

Locally Discriminant Projection with Kernels for Feature Extraction | 586 |

A GABased Feature Subset Selection and Parameter Optimization of Support Vector Machine for ContentBased Image Retrieval | 594 |

EvolutionBased Technique for Stream Clustering | 605 |

A New Hierarchical Clustering Based on Bayesian Networks | 616 |

An Improved AdaBoost Algorithm Based on Adaptive Weight Adjusting | 625 |

633 | |

### Common terms and phrases

accuracy ADMA aggregation Alhajj amino acids analysis approach association rules attributes Bayesian network Berlin Heidelberg 2007 China classification clustering algorithm clustering method Computer data mining data points data stream database dataset DBSCAN deﬁned denotes detection diﬀerent dimensional distance enzyme equivalence partition evaluation experimental results experiments extraction feature feature extraction ﬁrst frequent itemsets function fuzzy fuzzy sets gene regulatory network genetic algorithms graph Heidelberg IEEE input k-anonymity k-medoids kernel KLCI linear Machine Learning matrix measure naive Bayes neighbor neural network optimization outliers paper parameters partitioned partitioned databases performance pixels prediction problem proposed protein protocol pseudoknots query reactive motifs records rough sets samples Section selected sememe sensor sensor nodes sequence sequential patterns similarity space Springer Springer-Verlag Berlin Heidelberg step structure subset Support Vector Machines Table techniques Technology threshold tree tuple update user session variables Wang weight Zhang