The imbalance dataset problem arises in many domains,such as web page search, scam sites detection. In this paper,we propose an alternative re-sampling approach to dealwith imbalance datasets. We demonstrate this approach with a concrete implementat
Mining of Massive Datasets Anand Rajaraman Kosmix, Inc. Jeffrey D. Ullman Stanford Univ. Copyright Copyright c 2010, 2011 Anand Rajaraman and Jeffrey D. Ullman
《Mining of Massive Datasets》Jure Leskovec 数据挖掘经典参考书。国外上学时教授推荐!! The popularity of the Web and Internet commerce provides many extremely large datasets from which information can be gleaned by data mining. This book focuses on practical algorithms th