Skip to content

Latest commit

 

History

History
201 lines (200 loc) · 58.9 KB

File metadata and controls

201 lines (200 loc) · 58.9 KB

KDD2013 Paper List

论文 作者 摘要 翻译 代码 引用数
Auto-WEKA: combined selection and hyperparameter optimization of classification algorithms Chris Thornton, Frank Hutter, Holger H. Hoos, Kevin LeytonBrown code 623
U-Air: when urban air quality inference meets big data Yu Zheng, Furui Liu, HsunPing Hsieh code 527
Ad click prediction: a view from the trenches H. Brendan McMahan, Gary Holt, David Sculley, Michael Young, Dietmar Ebner, Julian Grady, Lan Nie, Todd Phillips, Eugene Davydov, Daniel Golovin, Sharat Chikkerur, Dan Liu, Martin Wattenberg, Arnar Mar Hrafnkelsson, Tom Boulos, Jeremy Kubica code 471
FISM: factored item similarity models for top-N recommender systems Santosh Kabbur, Xia Ning, George Karypis code 381
Learning geographical preferences for point-of-interest recommendation Bin Liu, Yanjie Fu, Zijun Yao, Hui Xiong code 281
Connecting users across social media sites: a behavioral-modeling approach Reza Zafarani, Huan Liu code 261
LCARS: a location-content-aware recommender system Hongzhi Yin, Yizhou Sun, Bin Cui, Zhiting Hu, Ling Chen code 251
Why people hate your app: making sense of user feedback in a mobile app store Bin Fu, Jialiu Lin, Lei Li, Christos Faloutsos, Jason I. Hong, Norman M. Sadeh code 246
Spotting opinion spammers using behavioral footprints Arjun Mukherjee, Abhinav Kumar, Bing Liu, Junhui Wang, Meichun Hsu, Malú Castellanos, Riddhiman Ghosh code 212
Online controlled experiments at large scale Ron Kohavi, Alex Deng, Brian Frasca, Toby Walker, Ya Xu, Nils Pohlmann code 206
Collaborative matrix factorization with multiple similarities for predicting drug-target interactions Xiaodong Zheng, Hao Ding, Hiroshi Mamitsuka, Shanfeng Zhu code 177
Denser than the densest subgraph: extracting optimal quasi-cliques with quality guarantees Charalampos E. Tsourakakis, Francesco Bonchi, Aristides Gionis, Francesco Gullo, Maria A. Tsiarli code 176
Geo-spotting: mining online location-based services for optimal retail store placement Dmytro Karamshuk, Anastasios Noulas, Salvatore Scellato, Vincenzo Nicosia, Cecilia Mascolo code 167
Who, where, when and what: discover spatio-temporal topics for twitter users Quan Yuan, Gao Cong, Zongyang Ma, Aixin Sun, Nadia MagnenatThalmann code 161
Accurate intelligible models with pairwise interactions Yin Lou, Rich Caruana, Johannes Gehrke, Giles Hooker code 146
TurboGraph: a fast parallel graph engine handling billion-scale graphs in a single PC WookShin Han, Sangyeon Lee, Kyungyeol Park, JeongHoon Lee, MinSoo Kim, Jinha Kim, Hwanjo Yu code 146
Fast and scalable polynomial kernels via explicit feature maps Ninh Pham, Rasmus Pagh code 135
Real-time disease surveillance using Twitter data: demonstration on flu and cancer Kathy Lee, Ankit Agrawal, Alok N. Choudhary code 117
Big data analytics for healthcare Jimeng Sun, Chandan K. Reddy code 117
Simple and deterministic matrix sketching Edo Liberty code 112
Combining latent factor model with location features for event-based group recommendation Wei Zhang, Jianyong Wang, Wei Feng code 108
Subsampling for efficient and effective unsupervised outlier detection ensembles Arthur Zimek, Matthew Gaudet, Ricardo J. G. B. Campello, Jörg Sander code 107
Discriminant malware distance learning on structural information for automated malware classification Deguang Kong, Guanhua Yan code 100
The role of information diffusion in the evolution of social networks Lilian Weng, Jacob Ratkiewicz, Nicola Perra, Bruno Gonçalves, Carlos Castillo, Francesco Bonchi, Rossano Schifanella, Filippo Menczer, Alessandro Flammini code 96
Privacy-preserving data exploration in genome-wide association studies Aaron Johnson, Vitaly Shmatikov code 90
Cascading outbreak prediction in networks: a data-driven approach Peng Cui, Shifei Jin, Linyun Yu, Fei Wang, Wenwu Zhu, Shiqiang Yang code 88
Location-aware publish/subscribe Guoliang Li, Yang Wang, Ting Wang, Jianhua Feng code 85
Linking named entities in Tweets with knowledge base via user interest modeling Wei Shen, Jianyong Wang, Ping Luo, Min Wang code 84
On the equivalent of low-rank linear regressions and linear discriminant analysis based regressions Xiao Cai, Chris H. Q. Ding, Feiping Nie, Heng Huang code 82
Graph cluster randomization: network exposure to multiple universes Johan Ugander, Brian Karrer, Lars Backstrom, Jon M. Kleinberg code 80
Social influence based clustering of heterogeneous information networks Yang Zhou, Ling Liu code 75
Confluence: conformity influence in large social networks Jie Tang, Sen Wu, Jimeng Sun code 75
Discovering latent influence in online social activities via shared cascade poisson processes Tomoharu Iwata, Amar Shah, Zoubin Ghahramani code 74
Cost-sensitive online active learning with application to malicious URL detection Peilin Zhao, Steven C. H. Hoi code 73
Restreaming graph partitioning: simple versatile algorithms for advanced balancing Joel Nishimura, Johan Ugander code 70
A space efficient streaming algorithm for triangle counting using the birthday paradox Madhav Jha, C. Seshadhri, Ali Pinar code 68
SIGMa: simple greedy matching for aligning large knowledge bases Simon LacosteJulien, Konstantina Palla, Alex Davies, Gjergji Kasneci, Thore Graepel, Zoubin Ghahramani code 65
Modeling and probabilistic reasoning of population evacuation during large-scale disaster Xuan Song, Quanshi Zhang, Yoshihide Sekimoto, Teerayut Horanont, Satoshi Ueyama, Ryosuke Shibasaki code 62
Stochastic collapsed variational Bayesian inference for latent Dirichlet allocation James R. Foulds, Levi Boyles, Christopher DuBois, Padhraic Smyth, Max Welling code 60
Detecting insider threats in a real corporate database of computer usage activity Ted E. Senator, Henry G. Goldberg, Alex Memory, William T. Young, Brad Rees, Robert Pierce, Daniel Huang, Matthew Reardon, David A. Bader, Edmond Chow, Irfan A. Essa, Joshua Jones, Vinay Bettadapura, Duen Horng Chau, Oded Green, Oguz Kaya, Anita Zakrzewska, Erica Briscoe, Rudolph L. Mappus IV, Robert McColl, Lora Weiss, Thomas G. Dietterich, Alan Fern, WengKeen Wong, Shubhomoy Das, Andrew Emmott, Jed Irvine, Jay Yoon Lee, Danai Koutra, Christos Faloutsos, Daniel D. Corkill, Lisa Friedland, Amanda Gentzel, David D. Jensen code 60
Recursive regularization for large-scale classification with hierarchical and graphical dependencies Siddharth Gopal, Yiming Yang code 59
Knowledge discovery from massive healthcare claims data Varun Chandola, Sreenivas R. Sukumar, Jack C. Schryver code 59
DTW-D: time series semi-supervised learning from a single example Yanping Chen, Bing Hu, Eamonn J. Keogh, Gustavo E. A. P. A. Batista code 57
Mining evidences for named entity disambiguation Yang Li, Chi Wang, Fangqiu Han, Jiawei Han, Dan Roth, Xifeng Yan code 57
Information cartography: creating zoomable, large-scale maps of information Dafna Shahaf, Jaewon Yang, Caroline Suen, Jeff Jacobs, Heidi Wang, Jure Leskovec code 56
Entity resolution for big data Lise Getoor, Ashwin Machanavajjhala code 56
Mining high utility episodes in complex event sequences ChengWei Wu, YuFeng Lin, Philip S. Yu, Vincent S. Tseng code 55
Mining frequent graph patterns with differential privacy Entong Shen, Ting Yu code 53
Assessing team strategy using spatiotemporal data Patrick Lucey, Dean Oliver, Peter Carr, Joe Roth, Iain A. Matthews code 53
Statistical quality estimation for general crowdsourcing tasks Yukino Baba, Hisashi Kashima code 51
Model-based kernel for efficient time series analysis Huanhuan Chen, Fengzhen Tang, Peter Tiño, Xin Yao code 51
A phrase mining framework for recursive construction of a topical hierarchy Chi Wang, Marina Danilevsky, Nihit Desai, Yinan Zhang, Phuong Nguyen, Thrivikrama Taula, Jiawei Han code 51
Evaluating the crowd with confidence Manas Joglekar, Hector GarciaMolina, Aditya G. Parameswaran code 50
Inferring social roles and statuses in social networks Yuchen Zhao, Guan Wang, Philip S. Yu, Shaobo Liu, Simon Zhang code 50
The bang for the buck: fair competitive viral marketing from the host perspective Wei Lu, Francesco Bonchi, Amit Goyal, Laks V. S. Lakshmanan code 49
Multi-label classification by mining label and instance correlations from heterogeneous information networks Xiangnan Kong, Bokai Cao, Philip S. Yu code 48
Robust principal component analysis via capped norms Qian Sun, Shuo Xiang, Jieping Ye code 46
Understanding Twitter data with TweetXplorer Fred Morstatter, Shamanth Kumar, Huan Liu, Ross Maciejewski code 45
Network discovery via constrained tensor analysis of fMRI data Ian N. Davidson, Sean Gilpin, Owen T. Carmichael, Peter B. Walker code 45
Flexible and robust co-regularized multi-domain graph clustering Wei Cheng, Xiang Zhang, Zhishan Guo, Yubao Wu, Patrick F. Sullivan, Wei Wang code 44
Making recommendations from multiple domains Wei Chen, Wynne Hsu, MongLi Lee code 43
Maximizing acceptance probability for active friending in online social networks DeNian Yang, HuiJu Hung, WangChien Lee, Wei Chen code 43
Silence is also evidence: interpreting dwell time for recommendation from psychological perspective Peifeng Yin, Ping Luo, WangChien Lee, Min Wang code 42
Redundancy-aware maximal cliques Jia Wang, James Cheng, Ada WaiChee Fu code 42
Comparing apples to oranges: a scalable solution with heterogeneous hashing Mingdong Ou, Peng Cui, Fei Wang, Jun Wang, Wenwu Zhu, Shiqiang Yang code 41
Multi-label relational neighbor classification using social context features Xi Wang, Gita Sukthankar code 41
Trace complexity of network inference Bruno D. Abrahao, Flavio Chierichetti, Robert Kleinberg, Alessandro Panconesi code 41
STED: semi-supervised targeted-interest event detectionin in twitter Ting Hua, Feng Chen, Liang Zhao, ChangTien Lu, Naren Ramakrishnan code 40
STRIP: stream learning of influence probabilities Konstantin Kutzkov, Albert Bifet, Francesco Bonchi, Aristides Gionis code 40
Fast rank-2 nonnegative matrix factorization for hierarchical document clustering Da Kuang, Haesun Park code 39
A new collaborative filtering approach for increasing the aggregate diversity of recommender systems Katja Niemann, Martin Wolpers code 38
Guided learning for role discovery (GLRD): framework, algorithms, and applications Sean Gilpin, Tina EliassiRad, Ian N. Davidson code 38
Scalable all-pairs similarity search in metric spaces Ye Wang, Ahmed Metwally, Srinivasan Parthasarathy code 37
Diversity maximization under matroid constraints Zeinab Abbassi, Vahab S. Mirrokni, Mayur Thakur code 37
Synthetic review spamming and defense Huan Sun, Alex Morales, Xifeng Yan code 37
An efficient ADMM algorithm for multidimensional anisotropic total variation regularization problems Sen Yang, Jie Wang, Wei Fan, Xiatian Zhang, Peter Wonka, Jieping Ye code 35
One theme in all views: modeling consensus topics in multiple contexts Jian Tang, Ming Zhang, Qiaozhu Mei code 35
Multi-source learning with block-wise missing data for Alzheimer's disease prediction Shuo Xiang, Lei Yuan, Wei Fan, Yalin Wang, Paul M. Thompson, Jieping Ye code 34
Unsupervised link prediction using aggregative statistics on heterogeneous social networks TsungTing Kuo, Rui Yan, YuYang Huang, PerngHwa Kung, ShouDe Lin code 34
Scalable text and link analysis with mixed-topic link models Yaojia Zhu, Xiaoran Yan, Lise Getoor, Cristopher Moore code 33
Uncertainty in online experiments with dependent data: an evaluation of bootstrap methods Eytan Bakshy, Dean Eckles code 32
Cross-task crowdsourcing Kaixiang Mo, Erheng Zhong, Qiang Yang code 32
Understanding evolution of research themes: a probabilistic generative model for citations Xiaolong Wang, Chengxiang Zhai, Dan Roth code 30
Debiasing social wisdom Abhimanyu Das, Sreenivas Gollapudi, Rina Panigrahy, Mahyar Salek code 30
Multi-source deep learning for information trustworthiness estimation Liang Ge, Jing Gao, Xiaoyi Li, Aidong Zhang code 30
Big data analytics with small footprint: squaring the cloud John F. Canny, Huasha Zhao code 30
Adaptive collective routing using gaussian process dynamic congestion models Siyuan Liu, Yisong Yue, Ramayya Krishnan code 28
A "semi-lazy" approach to probabilistic path prediction Jingbo Zhou, Anthony K. H. Tung, Wei Wu, Wee Siong Ng code 28
Text-based measures of document diversity Kevin Bache, David Newman, Padhraic Smyth code 27
Forex-foreteller: currency trend modeling using news articles Fang Jin, Nathan Self, Parang Saraf, Patrick Butler, Wei Wang, Naren Ramakrishnan code 25
Querying discriminative and representative samples for batch mode active learning Zheng Wang, Jieping Ye code 24
Information cascade at group scale Milad Eftekhar, Yashar Ganjali, Nick Koudas code 24
JobMiner: a real-time system for mining job-related patterns from social media Yu Cheng, Yusheng Xie, Zhengzhang Chen, Ankit Agrawal, Alok N. Choudhary, Songtao Guo code 24
Gaussian multiple instance learning approach for mapping the slums of the world using very high resolution imagery Ranga Raju Vatsavai code 22
Active learning and search on low-rank matrices Danica J. Sutherland, Barnabás Póczos, Jeff G. Schneider code 21
WiseMarket: a new paradigm for managing wisdom of online social users Caleb Chen Cao, Yongxin Tong, Lei Chen, H. V. Jagadish code 21
Extracting social events for learning better information diffusion models Shuyang Lin, Fengjiao Wang, Qingbo Hu, Philip S. Yu code 21
Efficient single-source shortest path and distance queries on large graphs Andy Diwen Zhu, Xiaokui Xiao, Sibo Wang, Wenqing Lin code 21
An integrated framework for optimizing automatic monitoring systems in large IT infrastructures Liang Tang, Tao Li, Larisa Shwartz, Florian Pinel, Genady Grabarnik code 21
Psychological advertising: exploring user psychology for click prediction in sponsored search Taifeng Wang, Jiang Bian, Shusen Liu, Yuyu Zhang, TieYan Liu code 20
Towards never-ending learning from time series streams Yuan Hao, Yanping Chen, Jesin Zakaria, Bing Hu, Thanawin Rakthanmanon, Eamonn J. Keogh code 20
An integrated framework for suicide risk prediction Truyen Tran, Dinh Q. Phung, Wei Luo, Richard Harvey, Michael Berk, Svetha Venkatesh code 20
A tool for collecting provenance data in social media Pritam Gundecha, Suhas Ranganath, Zhuo Feng, Huan Liu code 20
Selective sampling on graphs for classification Quanquan Gu, Charu C. Aggarwal, Jialu Liu, Jiawei Han code 19
On community detection in real-world networks and the importance of degree assortativity Marek Ciglan, Michal Laclavik, Kjetil Nørvåg code 19
FIU-Miner: a fast, integrated, and user-friendly system for data mining in distributed environment Chunqiu Zeng, Yexi Jiang, Li Zheng, Jingxuan Li, Lei Li, Hongtai Li, Chao Shen, Wubai Zhou, Tao Li, Bing Duan, Ming Lei, Pengnian Wang code 18
Mining discriminative subgraphs from global-state networks Sayan Ranu, Minh X. Hoang, Ambuj K. Singh code 18
Multi-space probabilistic sequence modeling Shuo Chen, Jiexun Xu, Thorsten Joachims code 18
Active search on graphs Xuezhi Wang, Roman Garnett, Jeff G. Schneider code 17
Automatic selection of social media responses to news Tadej Stajner, Bart Thomee, AnaMaria Popescu, Marco Pennacchiotti, Alejandro Jaimes code 17
Approximate graph mining with label costs Pranay Anchuri, Mohammed J. Zaki, Omer Barkol, Shahar Golan, Moshe Shamy code 16
Using co-visitation networks for detecting large scale online display advertising exchange fraud Ori Stitelman, Claudia Perlich, Brian Dalessandro, Rod Hook, Troy Raeder, Foster J. Provost code 16
A transfer learning based framework of crowd-selection on twitter Zhou Zhao, Da Yan, Wilfred Ng, Shi Gao code 16
Heat pump detection from coarse grained smart meter data with positive and unlabeled learning Hongliang Fei, Younghun Kim, Sambit Sahu, Milind R. Naphade, Sanjay K. Mamidipalli, John Hutchinson code 16
FeaFiner: biomarker identification from medical data through feature generalization and selection Jiayu Zhou, Zhaosong Lu, Jimeng Sun, Lei Yuan, Fei Wang, Jieping Ye code 15
A unified search federation system based on online user feedback Luo Jie, Sudarshan Lamkhede, Rochit Sapra, Evans Hsu, Helen Song, Yi Chang code 14
Direct optimization of ranking measures for learning to rank models Ming Tan, Tian Xia, Lily Guo, Shaojun Wang code 14
Predictive model performance: offline and online evaluations Jeonghee Yi, Ye Chen, Jie Li, Swaraj Sett, Tak W. Yan code 14
SVMpAUCtight: a new support vector method for optimizing partial AUC based on a tight convex upper bound Harikrishna Narasimhan, Shivani Agarwal code 14
Summarizing probabilistic frequent patterns: a fast approach Chunyang Liu, Ling Chen, Chengqi Zhang code 14
Link prediction with social vector clocks Conrad Lee, Bobo Nick, Ulrik Brandes, Pádraig Cunningham code 14
Empirical bayes model to combine signals of adverse drug reactions Rave Harpaz, William DuMouchel, Paea LePendu, Nigam H. Shah code 14
Density-based logistic regression Wenlin Chen, Yixin Chen, Yi Mao, Baolong Guo code 13
Massively parallel expectation maximization using graphics processing units Muzaffer Can Altinigneli, Claudia Plant, Christian Böhm code 13
Modeling the dynamics of composite social networks Erheng Zhong, Wei Fan, Yin Zhu, Qiang Yang code 13
Learning to question: leveraging user preferences for shopping advice Mahashweta Das, Gianmarco De Francisci Morales, Aristides Gionis, Ingmar Weber code 12
Mining evolutionary multi-branch trees from text streams Xiting Wang, Shixia Liu, Yangqiu Song, Baining Guo code 12
MI2LS: multi-instance learning from multiple informationsources Dan Zhang, Jingrui He, Richard D. Lawrence code 12
The business impact of deep learning Jeremy Howard code 12
Towards long-lead forecasting of extreme flood events: a data mining framework for precipitation cluster precursors identification Dawei Wang, Wei Ding, Kui Yu, Xindong Wu, Ping Chen, David L. Small, Shafiqul Islam code 12
KeySee: supporting keyword search on evolving events in social streams Pei Lee, Laks V. S. Lakshmanan, Evangelos E. Milios code 11
Collaborative boosting for activity classification in microblogs Yangqiu Song, Zhengdong Lu, Cane Wingki Leung, Qiang Yang code 11
Learning mixed kronecker product graph models with simulated method of moments Sebastián Moreno, Jennifer Neville, Sergey Kirshner code 11
iHR: an online recruiting system for Xiamen Talent Service Center Wenxing Hong, Lei Li, Tao Li, Wenfu Pan code 11
A time-dependent enhanced support vector machine for time series regression Goce Ristanoski, Wei Liu, James Bailey code 11
A general bootstrap performance diagnostic Ariel Kleiner, Ameet Talwalkar, Sameer Agarwal, Ion Stoica, Michael I. Jordan code 10
LAICOS: an open source platform for personalized social web search Mohamed Reda Bouadjenek, Hakim Hacid, Mokrane Bouzeghoub code 9
Measuring spontaneous devaluations in user preferences Komal Kapoor, Nisheeth Srivastava, Jaideep Srivastava, Paul R. Schrater code 9
Scalable inference in max-margin topic models Jun Zhu, Xun Zheng, Li Zhou, Bo Zhang code 9
Improving quality control by early prediction of manufacturing outcomes Sholom M. Weiss, Amit Dhurandhar, Robert J. Baseman code 9
Palette power: enabling visual search through colors Anurag Bhardwaj, Atish Das Sarma, Wei Di, Raffay Hamid, Robinson Piramuthu, Neel Sundaresan code 8
EventCube: multi-dimensional search and mining of structured and text data Fangbo Tao, Kin Hou Lei, Jiawei Han, Chengxiang Zhai, Xiao Cheng, Marina Danilevsky, Nihit Desai, Bolin Ding, Jing Ge, Heng Ji, Rucha Kanade, Anne Kao, Qi Li, Yanen Li, Cindy Xide Lin, Jialu Liu, Nikunj C. Oza, Ashok N. Srivastava, Rodney Tjoelker, Chi Wang, Duo Zhang, Bo Zhao code 8
Amplifying the voice of youth in Africa via text analytics Prem Melville, Vijil Chenthamarakshan, Richard D. Lawrence, James Powell, Moses Mugisha, Sharad Sapra, Rajesh Anandan, Solomon Assefa code 8
Inferring distant-time location in low-sampling-rate trajectories MengFen Chiang, YungHsiang Lin, WenChih Peng, Philip S. Yu code 7
AMETHYST: a system for mining and exploring topical hierarchies of heterogeneous data Marina Danilevsky, Chi Wang, Fangbo Tao, Son Nguyen, Gong Chen, Nihit Desai, Lidan Wang, Jiawei Han code 7
Representing documents through their readers Khalid ElArini, Min Xu, Emily B. Fox, Carlos Guestrin code 7
Risk-O-Meter: an intelligent clinical risk calculator Kiyana Zolfaghar, Jayshree Agarwal, Deepthi Sistla, SiChi Chin, Senjuti Basu Roy, Nele Verbiest code 7
Query clustering based on bid landscape for sponsored search auction optimization Ye Chen, Weiguo Liu, Jeonghee Yi, Anton Schwaighofer, Tak W. Yan code 6
Dynamic memory allocation policies for postings in real-time Twitter search Nima Asadi, Jimmy Lin, Michael Busch code 6
Fast structure learning in generalized stochastic processes with latent factors Mohammad Taha Bahadori, Yan Liu, Eric P. Xing code 6
SEA: a system for event analysis on chinese tweets Yaqiong Wang, Hongfu Liu, Hao Lin, Junjie Wu, Zhiang Wu, Jie Cao code 6
Network sampling Lise Getoor, Ashwin Machanavajjhala code 5
Optimizing parallel belief propagation in junction treesusing regression Lu Zheng, Ole J. Mengshoel code 5
Cyber security: how visual analytics unlock insight Raffael Marty code 5
Analysis of advanced meter infrastructure data of water consumption in apartment buildings Einat Kermany, Hanna Mazzawi, Dorit Baras, Yehuda Naveh, Hagai Michaelis code 5
Mining for geographically disperse communities in social networks by leveraging distance modularity Paulo Shakarian, Patrick Roos, Devon Callahan, Cory Kirk code 5
Estimating sharer reputation via social data calibration Jaewon Yang, BeeChung Chen, Deepak Agarwal code 4
Constrained stochastic gradient descent for large-scale least squares problem Yang Mu, Wei Ding, Tianyi Zhou, Dacheng Tao code 4
Financing lead triggers: empowering sales reps through knowledge discovery and fusion Kareem S. Aggour, Bethany Hoogs code 4
Exploratory analysis of highly heterogeneous document collections Arun S. Maiya, John P. Thompson, Francisco LoaizaLemos, Robert M. Rolfe code 4
A data-driven method for in-game decision making in MLB: when to pull a starting pitcher Gartheeban Ganeshapillai, John V. Guttag code 4
Mining data from mobile devices: a survey of smart sensing and analytics Spiros Papadimitriou, Tina EliassiRad code 4
Robust sparse estimation of multiresponse regression and inverse covariance matrix via the L2 distance Aurélie C. Lozano, Huijing Jiang, Xinwei Deng code 3
Beyond myopic inference in big data pipelines Karthik Raman, Adith Swaminathan, Johannes Gehrke, Thorsten Joachims code 3
Model selection in markovian processes Assaf Hallak, Dotan Di Castro, Shie Mannor code 3
Mining lines in the sand: on trajectory discovery from untrustworthy data in cyber-physical system LuAn Tang, Xiao Yu, Quanquan Gu, Jiawei Han, Alice Leung, Thomas La Porta code 3
Nonparametric hierarchal bayesian modeling in non-contractual heterogeneous survival data Shouichi Nagano, Yusuke Ichikawa, Noriko Takaya, Tadasu Uchiyama, Makoto Abe code 3
Quadratic optimization to identify highly heritable quantitative traits from complex phenotypic features Jiangwen Sun, Jinbo Bi, Henry R. Kranzler code 3
Adaptive adversaries: building systems to fight fraud and cyber intruders Ari Gesher code 3
Hadoop: a view from the trenches Milind Bhandarkar code 3
Scalable supervised dimensionality reduction using clustering Troy Raeder, Claudia Perlich, Brian Dalessandro, Ori Stitelman, Foster J. Provost code 3
Experience from hosting a corporate prediction market: benefits beyond the forecasts Thomas A. Montgomery, Paul M. Stieg, Michael J. Cavaretta, Paul E. Moraal code 3
Exploiting user clicks for automatic seed set generation for entity matching Xiao Bai, Flavio Paiva Junqueira, Srinivasan H. Sengamedu code 2
Speeding up large-scale learning with a social prior Deepayan Chakrabarti, Ralf Herbrich code 2
LAFT-Explorer: inferring, visualizing and predicting how your social network expands Jun Zhang, Chaokun Wang, Yuanchi Ning, Yichi Liu, Jianmin Wang, Philip S. Yu code 2
Trial and error in influential social networks Xiaohui Bei, Ning Chen, Liyu Dou, Xiangru Huang, Ruixin Qiang code 2
Succinct interval-splitting tree for scalable similarity search of compound-protein pairs with property constraints Yasuo Tabei, Akihiro Kishimoto, Masaaki Kotera, Yoshihiro Yamanishi code 1
The online revolution: education for everyone Andrew Y. Ng, Daphne Koller code 1
Indexed block coordinate descent for large-scale linear classification with limited memory Ian EnHsu Yen, ChunFu Chang, TingWei Lin, ShanWei Lin, ShouDe Lin code 1
Exact sparse recovery with L0 projections Ping Li, CunHui Zhang code 1
Targeting and influencing at scale: from presidential elections to social good Rayid Ghani code 1
A data mining driven risk profiling method for road asset management Daniel Emerson, Justin Weligamage, Richi Nayak code 1
Efficiently rewriting large multimedia application execution traces with few event sequences Christiane Kamdem Kengne, Léon Constantin Fopa, Alexandre Termier, Noha Ibrahim, MarieChristine Rousset, Takashi Washio, Miguel Santana code 1
A privacy preserving framework for managing vehicle data in road pricing systems Huayu Wu, Wee Siong Ng, KianLee Tan, Wei Wu, Shili Xiang, Mingqiang Xue code 1
Algorithmic techniques for modeling and mining large graphs (AMAzING) Alan M. Frieze, Aristides Gionis, Charalampos E. Tsourakakis code 1
Predicting the present with search engine data Hal R. Varian code 0
Mining the digital universe of data to develop personalized cancer therapies Eric E. Schadt code 0
An online system with end-user services: mining novelty concepts from tv broadcast subtitles Mika Rautiainen, Jouni Sarvanko, Arto Heikkinen, Mika Ylianttila, Vassilis Kostakos code 0
When TEDDY meets GrizzLY: temporal dependency discovery for triggering road deicing operations Céline Robardet, VasileMarian Scuturici, Marc Plantevit, Antoine Fraboulet code 0
Scale-out beyond map-reduce Raghu Ramakrishnan code 0
Optimization in learning and data analysis Stephen J. Wright code 0
Repetition-aware content placement in navigational networks Dóra Erdös, Vatche Ishakian, Azer Bestavros, Evimaria Terzi code 0
To buy or not to buy: that is the question Oren Etzioni code 0
Using "big data" to solve "small data" problems Chris Neumann code 0
Panel: a data scientist's guide to making money from start-ups Foster J. Provost, Geoffrey I. Webb code 0
SAE: social analytic engine for large networks Yang Yang, Jianfei Wang, Yutao Zhang, Wei Chen, Jing Zhang, Honglei Zhuang, Zhilin Yang, Bo Ma, Zhanpeng Fang, Sen Wu, Xiaoxiao Li, Debing Liu, Jie Tang code 0
The dataminer's guide to scalable mixed-membership and nonparametric bayesian models Amr Ahmed, Alexander J. Smola code 0