Skip to content

Latest commit

 

History

History
240 lines (239 loc) · 70.5 KB

File metadata and controls

240 lines (239 loc) · 70.5 KB

KDD2016 Paper List

论文 作者 摘要 翻译 代码 引用数
XGBoost: A Scalable Tree Boosting System Tianqi Chen, Carlos Guestrin code 12882
node2vec: Scalable Feature Learning for Networks Aditya Grover, Jure Leskovec code 4647
"Why Should I Trust You?": Explaining the Predictions of Any Classifier Marco Túlio Ribeiro, Sameer Singh, Carlos Guestrin code 4561
Structural Deep Network Embedding Daixin Wang, Peng Cui, Wenwu Zhu code 1374
Collaborative Knowledge Base Embedding for Recommender Systems Fuzheng Zhang, Nicholas Jing Yuan, Defu Lian, Xing Xie, WeiYing Ma code 717
Asymmetric Transitivity Preserving Graph Embedding Mingdong Ou, Peng Cui, Jian Pei, Ziwei Zhang, Wenwu Zhu code 570
Interpretable Decision Sets: A Joint Framework for Description and Prediction Himabindu Lakkaraju, Stephen H. Bach, Jure Leskovec code 256
Recurrent Marked Temporal Point Processes: Embedding Event History to Vector Nan Du, Hanjun Dai, Rakshit Trivedi, Utkarsh Upadhyay, Manuel GomezRodriguez, Le Song code 232
Convolutional Neural Networks for Steady Flow Approximation Xiaoxiao Guo, Wei Li, Francesco Iorio code 222
Multi-layer Representation Learning for Medical Concepts Edward Choi, Mohammad Taha Bahadori, Elizabeth Searles, Catherine Coffey, Michael Thompson, James Bost, Javier TejedorSojo, Jimeng Sun code 218
Deep Crossing: Web-Scale Modeling without Manually Crafted Combinatorial Features Ying Shan, T. Ryan Hoens, Jian Jiao, Haijing Wang, Dong Yu, J. C. Mao code 203
CNTK: Microsoft's Open-Source Deep-Learning Toolkit Frank Seide, Amit Agarwal code 203
Algorithmic Bias: From Discrimination Discovery to Fairness-aware Data Mining Sara Hajian, Francesco Bonchi, Carlos Castillo code 192
Towards Conversational Recommender Systems Konstantina Christakopoulou, Filip Radlinski, Katja Hofmann code 166
Point-of-Interest Recommendations: Learning Potential Check-ins from Friends Huayu Li, Yong Ge, Richang Hong, Hengshu Zhu code 153
Deep Visual-Semantic Hashing for Cross-Modal Retrieval Yue Cao, Mingsheng Long, Jianmin Wang, Qiang Yang, Philip S. Yu code 153
FRAUDAR: Bounding Graph Fraud in the Face of Camouflage Bryan Hooi, Hyun Ah Song, Alex Beutel, Neil Shah, Kijung Shin, Christos Faloutsos code 147
Rebalancing Bike Sharing Systems: A Multi-source Data Smart Optimization Junming Liu, Leilei Sun, Weiwei Chen, Hui Xiong code 125
FINAL: Fast Attributed Network Alignment Si Zhang, Hanghang Tong code 120
Extreme Multi-label Loss Functions for Recommendation, Tagging, Ranking & Other Missing Label Applications Himanshu Jain, Yashoteja Prabhu, Manik Varma code 111
Smart Reply: Automated Response Suggestion for Email Anjuli Kannan, Karol Kurach, Sujith Ravi, Tobias Kaufmann, Andrew Tomkins, Balint Miklos, Greg Corrado, László Lukács, Marina Ganea, Peter Young, Vivek Ramavajjala code 111
Topic Modeling of Short Texts: A Pseudo-Document View Yuan Zuo, Junjie Wu, Hui Zhang, Hao Lin, Fei Wang, Ke Xu, Hui Xiong code 109
GMove: Group-Level Mobility Modeling Using Geo-Tagged Social Media Chao Zhang, Keyang Zhang, Quan Yuan, Luming Zhang, Tim Hanratty, Jiawei Han code 108
Meta Structure: Computing Relevance in Large Heterogeneous Information Networks Zhipeng Huang, Yudian Zheng, Reynold Cheng, Yizhou Sun, Nikos Mamoulis, Xiang Li code 107
Latent Space Model for Road Networks to Predict Time-Varying Traffic Dingxiong Deng, Cyrus Shahabi, Ugur Demiryurek, Linhong Zhu, Rose Yu, Yan Liu code 105
Crime Rate Inference with Big Data Hongjian Wang, Daniel Kifer, Corina Graif, Zhenhui Li code 96
User Identity Linkage by Latent User Space Modelling Xin Mu, Feida Zhu, EePeng Lim, Jing Xiao, Jianzong Wang, ZhiHua Zhou code 95
Predicting Disk Replacement towards Reliable Data Centers Mirela Madalina Botezatu, Ioana Giurgiu, Jasmina Bogojeska, Dorothea Wiesmann code 89
Fast Memory-efficient Anomaly Detection in Streaming Heterogeneous Graphs Emaad A. Manzoor, Sadegh M. Milajerdi, Leman Akoglu code 75
Robust Influence Maximization Wei Chen, Tian Lin, Zihan Tan, Mingfei Zhao, Xuren Zhou code 74
Extracting Optimal Performance from Dynamic Time Warping Abdullah Mueen, Eamonn J. Keogh code 70
Unified Point-of-Interest Recommendation with Temporal Interval Assessment Yanchi Liu, Chuanren Liu, Bin Liu, Meng Qu, Hui Xiong code 66
Anomaly Detection Using Program Control Flow Graph Mining From Execution Logs Animesh Nandi, Atri Mandal, Shubham Atreja, Gargi Banerjee Dasgupta, Subhrajit Bhattacharya code 65
Ranking Relevance in Yahoo Search Dawei Yin, Yuening Hu, Jiliang Tang, Tim Daly Jr., Mianwei Zhou, Hua Ouyang, Jianhui Chen, Changsung Kang, Hongbo Deng, Chikashi Nobata, JeanMarc Langlois, Yi Chang code 64
DeepIntent: Learning Attentions for Online Advertising with Recurrent Neural Networks Shuangfei Zhai, Kenghao Chang, Ruofei Zhang, Zhongfei (Mark) Zhang code 64
Online Context-Aware Recommendation with Time Varying Multi-Armed Bandit Chunqiu Zeng, Qing Wang, Shekoofeh Mokhtari, Tao Li code 63
A Multi-Task Learning Formulation for Survival Analysis Yan Li, Jie Wang, Jieping Ye, Chandan K. Reddy code 62
Dynamic Clustering of Streaming Short Documents Shangsong Liang, Emine Yilmaz, Evangelos Kanoulas code 60
Fast Unsupervised Online Drift Detection Using Incremental Kolmogorov-Smirnov Test Denis Moreira dos Reis, Peter A. Flach, Stan Matwin, Gustavo E. A. P. A. Batista code 60
Aircraft Trajectory Prediction Made Easy with Predictive Analytics Samet Ayhan, Hanan Samet code 59
Partial Label Learning via Feature-Aware Disambiguation MinLing Zhang, BinBin Zhou, XuYing Liu code 58
Compressing Graphs and Indexes with Recursive Graph Bisection Laxman Dhulipala, Igor Kabiljo, Brian Karrer, Giuseppe Ottaviano, Sergey Pupyrev, Alon Shalita code 57
Accelerating Online CP Decompositions for Higher Order Tensors Shuo Zhou, Xuan Vinh Nguyen, James Bailey, Yunzhe Jia, Ian Davidson code 56
Compressing Convolutional Neural Networks in the Frequency Domain Wenlin Chen, James T. Wilson, Stephen Tyree, Kilian Q. Weinberger, Yixin Chen code 56
Transfer Knowledge between Cities Ying Wei, Yu Zheng, Qiang Yang code 55
Robust Extreme Multi-label Learning Chang Xu, Dacheng Tao, Chao Xu code 54
Understanding Behaviors that Lead to Purchasing: A Case Study of Pinterest Caroline Lo, Dan Frankowski, Jure Leskovec code 51
Joint Community and Structural Hole Spanner Detection via Harmonic Modularity Lifang He, ChunTa Lu, Jiaqi Ma, Jianping Cao, Linlin Shen, Philip S. Yu code 51
Infinite Ensemble for Image Clustering Hongfu Liu, Ming Shao, Sheng Li, Yun Fu code 50
Label Noise Reduction in Entity Typing by Heterogeneous Partial-Label Embedding Xiang Ren, Wenqi He, Meng Qu, Clare R. Voss, Heng Ji, Jiawei Han code 48
Robust Influence Maximization Xinran He, David Kempe code 47
Repeat Buyer Prediction for E-Commerce Guimei Liu, Tam T. Nguyen, Gang Zhao, Wei Zha, Jianbo Yang, Jianneng Cao, Min Wu, Peilin Zhao, Wei Chen code 45
Catch Me If You Can: Detecting Pickpocket Suspects from Large-Scale Transit Records Bowen Du, Chuanren Liu, Wenjun Zhou, Zhenshan Hou, Hui Xiong code 45
GLMix: Generalized Linear Mixed Models For Large-Scale Response Prediction XianXing Zhang, Yitong Zhou, Yiming Ma, BeeChung Chen, Liang Zhang, Deepak Agarwal code 45
Approximate Personalized PageRank on Dynamic Graphs Hongyang Zhang, Peter Lofgren, Ashish Goel code 43
FASCINATE: Fast Cross-Layer Dependency Inference on Multi-layered Networks Chen Chen, Hanghang Tong, Lei Xie, Lei Ying, Qing He code 43
IoT Big Data Stream Mining Gianmarco De Francisci Morales, Albert Bifet, Latifur Khan, João Gama, Wei Fan code 42
Data-Driven Metric Development for Online Controlled Experiments: Seven Lessons Learned Alex Deng, Xiaolin Shi code 41
Matrix Computations and Optimization in Apache Spark Reza Bosagh Zadeh, Xiangrui Meng, Alexander Ulanov, Burak Yavuz, Li Pu, Shivaram Venkataraman, Evan R. Sparks, Aaron Staple, Matei Zaharia code 41
Towards Confidence in the Truth: A Bootstrapping based Truth Discovery Approach Houping Xiao, Jing Gao, Qi Li, Fenglong Ma, Lu Su, Yunlong Feng, Aidong Zhang code 41
Online Optimization Methods for the Quantification Problem Purushottam Kar, Shuai Li, Harikrishna Narasimhan, Sanjay Chawla, Fabrizio Sebastiani code 39
Learning Cumulatively to Become More Knowledgeable Geli Fei, Shuai Wang, Bing Liu code 38
Recruitment Market Trend Analysis with Sequential Latent Variable Models Chen Zhu, Hengshu Zhu, Hui Xiong, Pengliang Ding, Fang Xie code 37
Talent Circle Detection in Job Transition Networks Huang Xu, Zhiwen Yu, Jingyuan Yang, Hui Xiong, Hengshu Zhu code 36
PTE: Enumerating Trillion Triangles On Distributed Systems HaMyung Park, SungHyon Myaeng, U Kang code 36
Contextual Intent Tracking for Personal Assistants Yu Sun, Nicholas Jing Yuan, Yingzi Wang, Xing Xie, Kieran McDonald, Rui Zhang code 35
Modeling Precursors for Event Forecasting via Nested Multi-Instance Learning Yue Ning, Sathappan Muthiah, Huzefa Rangwala, Naren Ramakrishnan code 35
Diversified Temporal Subgraph Pattern Mining Yi Yang, Da Yan, Huanhuan Wu, James Cheng, Shuigeng Zhou, John C. S. Lui code 35
Overcoming Key Weaknesses of Distance-based Neighbourhood Methods using a Data Dependent Dissimilarity Measure Kai Ming Ting, Ye Zhu, Mark James Carman, Yue Zhu, ZhiHua Zhou code 34
Skinny-dip: Clustering in a Sea of Noise Samuel Maurus, Claudia Plant code 34
Structural Neighborhood Based Classification of Nodes in a Network Sharad Nandanwar, M. Narasimha Murty code 34
City-Scale Map Creation and Updating using GPS Collections Chen Chen, Cewu Lu, Qixing Huang, Qiang Yang, Dimitrios Gunopulos, Leonidas J. Guibas code 34
Portfolio Selections in P2P Lending: A Multi-Objective Perspective Hongke Zhao, Qi Liu, Guifeng Wang, Yong Ge, Enhong Chen code 33
AnyDBC: An Efficient Anytime Density-based Clustering Algorithm for Very Large Complex Datasets Son T. Mai, Ira Assent, Martin Storgaard code 33
Scalable Pattern Matching over Compressed Graphs via Dedensification Antonio Maccioni, Daniel J. Abadi code 32
TRIÈST: Counting Local and Global Triangles in Fully-Dynamic Streams with Fixed Memory Size Lorenzo De Stefani, Alessandro Epasto, Matteo Riondato, Eli Upfal code 32
Just One More: Modeling Binge Watching Behavior William Trouleau, Azin Ashkan, Weicong Ding, Brian Eriksson code 32
Ranking Causal Anomalies via Temporal and Dynamical Analysis on Vanishing Correlations Wei Cheng, Kai Zhang, Haifeng Chen, Guofei Jiang, Zhengzhang Chen, Wei Wang code 31
Beyond Sigmoids: The NetTide Model for Social Network Growth, and Its Applications Chengxi Zang, Peng Cui, Christos Faloutsos code 31
An Empirical Study on Recommendation with Multiple Types of Feedback Liang Tang, Bo Long, BeeChung Chen, Deepak Agarwal code 30
Taxi Driving Behavior Analysis in Latent Vehicle-to-Vehicle Networks: A Social Influence Perspective Tong Xu, Hengshu Zhu, Xiangyu Zhao, Qi Liu, Hao Zhong, Enhong Chen, Hui Xiong code 30
Data-driven Automatic Treatment Regimen Development and Recommendation Leilei Sun, Chuanren Liu, Chonghui Guo, Hui Xiong, Yanming Xie code 29
A Text Clustering Algorithm Using an Online Clustering Scheme for Initialization Jianhua Yin, Jianyong Wang code 29
Probabilistic Robust Route Recovery with Spatio-Temporal Dynamics Hao Wu, Jiangyun Mao, Weiwei Sun, Baihua Zheng, Hanyuan Zhang, Ziyang Chen, Wei Wang code 29
Hierarchical Incomplete Multi-source Feature Learning for Spatiotemporal Event Forecasting Liang Zhao, Jieping Ye, Feng Chen, ChangTien Lu, Naren Ramakrishnan code 29
Bid-aware Gradient Descent for Unbiased Learning with Censored Data in Display Advertising Weinan Zhang, Tianxiong Zhou, Jun Wang, Jian Xu code 28
Improving the Sensitivity of Online Controlled Experiments: Case Studies at Netflix Huizhi Xie, Juliette Aurisset code 27
An Engagement-Based Customer Lifetime Value System for E-commerce Ali Vanderveld, Addhyan Pandey, Angela Han, Rajesh Parekh code 26
Predicting Matchups and Preferences in Context Shuo Chen, Thorsten Joachims code 26
Domain Adaptation in the Absence of Source Domain Data Boris Chidlovskii, Stéphane Clinchant, Gabriela Csurka code 26
Firebird: Predicting Fire Risk and Prioritizing Fire Inspections in Atlanta Michael A. Madaio, ShangTse Chen, Oliver L. Haimson, Wenwen Zhang, Xiang Cheng, Matthew HindsAldrich, Duen Horng Chau, Bistra Dilkina code 26
Developing a Data-Driven Player Ranking in Soccer Using Predictive Model Weights Joel Brooks, Matthew Kerr, John V. Guttag code 25
Kam1n0: MapReduce-based Assembly Clone Search for Reverse Engineering Steven H. H. Ding, Benjamin C. M. Fung, Philippe Charland code 25
A Subsequence Interleaving Model for Sequential Pattern Mining Jaroslav M. Fowkes, Charles Sutton code 25
ABRA: Approximating Betweenness Centrality in Static and Dynamic Graphs with Rademacher Averages Matteo Riondato, Eli Upfal code 25
Identifying Police Officers at Risk of Adverse Events Samuel Carton, Jennifer Helsby, Kenneth Joseph, Ayesha Mahmud, Youngsoo Park, Joe Walsh, Crystal Cody, C. P. T. Estella Patterson, Lauren Haynes, Rayid Ghani code 25
Semi-Markov Switching Vector Autoregressive Model-Based Anomaly Detection in Aviation Systems Igor Melnyk, Arindam Banerjee, Bryan L. Matthews, Nikunj C. Oza code 25
Unbounded Human Learning: Optimal Scheduling for Spaced Repetition Siddharth Reddy, Igor Labutov, Siddhartha Banerjee, Thorsten Joachims code 25
Reconstructing an Epidemic Over Time Polina Rozenshtein, Aristides Gionis, B. Aditya Prakash, Jilles Vreeken code 25
DopeLearning: A Computational Approach to Rap Lyrics Generation Eric Malmi, Pyry Takala, Hannu Toivonen, Tapani Raiko, Aristides Gionis code 24
Streaming-LDA: A Copula-based Approach to Modeling Topic Dependencies in Document Streams Hesam Amoualian, Marianne Clausel, Éric Gaussier, MassihReza Amini code 23
Singapore in Motion: Insights on Public Transport Service Level Through Farecard and Mobile Data Analytics Hasan Poonawala, Vinay Kolar, Sebastien Blandin, Laura Wynter, Sambit Sahu code 23
A Truth Discovery Approach with Theoretical Guarantee Houping Xiao, Jing Gao, Zhaoran Wang, Shiyu Wang, Lu Su, Han Liu code 23
Targeted Topic Modeling for Focused Analysis Shuai Wang, Zhiyuan Chen, Geli Fei, Bing Liu, Sherry Emery code 22
Structured Doubly Stochastic Matrix for Graph Based Clustering: Structured Doubly Stochastic Matrix Xiaoqian Wang, Feiping Nie, Heng Huang code 22
Regime Shifts in Streams: Real-time Forecasting of Co-evolving Time Sequences Yasuko Matsubara, Yasushi Sakurai code 20
MANTRA: A Scalable Approach to Mining Temporally Anomalous Sub-trajectories Prithu Banerjee, Pranali Yawalkar, Sayan Ranu code 20
Finding Gangs in War from Signed Networks Lingyang Chu, Zhefeng Wang, Jian Pei, Jiannan Wang, Zijin Zhao, Enhong Chen code 20
Large-Scale Item Categorization in e-Commerce Using Multiple Recurrent Neural Networks JungWoo Ha, Hyuna Pyo, Jeonghee Kim code 19
Images Don't Lie: Transferring Deep Visual Semantic Features to Large-Scale Multimodal Learning to Rank Corey Lynch, Kamelia Aryafar, Josh Attenberg code 19
Gemello: Creating a Detailed Energy Breakdown from Just the Monthly Electricity Bill Nipun Batra, Amarjeet Singh, Kamin Whitehouse code 19
Multi-Task Feature Interaction Learning Kaixiang Lin, Jianpeng Xu, Inci M. Baytas, Shuiwang Ji, Jiayu Zhou code 19
Boosted Decision Tree Regression Adjustment for Variance Reduction in Online Controlled Experiments Alexey Poyarkov, Alexey Drutsa, Andrey Khalyavin, Gleb Gusev, Pavel Serdyukov code 19
Question Independent Grading using Machine Learning: The Case of Computer Program Grading Gursimran Singh, Shashank Srikant, Varun Aggarwal code 19
When Social Influence Meets Item Inference HuiJu Hung, HongHan Shuai, DeNian Yang, LiangHao Huang, WangChien Lee, Jian Pei, MingSyan Chen code 19
Online Asymmetric Active Learning with Imbalanced Data Xiaoxuan Zhang, Tianbao Yang, Padmini Srinivasan code 19
Days on Market: Measuring Liquidity in Real Estate Markets Hengshu Zhu, Hui Xiong, Fangshuang Tang, Qi Liu, Yong Ge, Enhong Chen, Yanjie Fu code 19
Mining Subgroups with Exceptional Transition Behavior Florian Lemmerich, Martin Becker, Philipp Singer, Denis Helic, Andreas Hotho, Markus Strohmaier code 18
Parallel Dual Coordinate Descent Method for Large-scale Linear Classification in Multi-core Environments WeiLin Chiang, MuChu Lee, ChihJen Lin code 18
Scalable Betweenness Centrality Maximization via Sampling Ahmad Mahmoody, Charalampos E. Tsourakakis, Eli Upfal code 18
Keeping it Short and Simple: Summarising Complex Event Sequences with Multivariate Patterns Roel Bertens, Jilles Vreeken, Arno Siebes code 18
Evaluating Mobile Apps with A/B and Quasi A/B Tests Ya Xu, Nanyu Chen code 17
Email Volume Optimization at LinkedIn Rupesh Gupta, Guanfeng Liang, HsiaoPing Tseng, Ravi Kiran Holur Vijay, Xiaoyu Chen, Rómer Rosales code 17
EMBERS at 4 years: Experiences operating an Open Source Indicators Forecasting System Sathappan Muthiah, Patrick Butler, Rupinder Paul Khandpur, Parang Saraf, Nathan Self, Alla Rozovskaya, Liang Zhao, Jose Cadena, ChangTien Lu, Anil Vullikanti, Achla Marathe, Kristen Maria Summers, Graham Katz, Andy Doyle, Jaime Arredondo, Dipak K. Gupta, David Mares, Naren Ramakrishnan code 17
How to Get Them a Dream Job?: Entity-Aware Features for Personalized Job Search Ranking Jia Li, Dhruv Arya, Viet HaThuc, Shakti Sinha code 16
FUSE: Full Spectral Clustering Wei Ye, Sebastian Goebl, Claudia Plant, Christian Böhm code 16
Positive-Unlabeled Learning in Streaming Networks Shiyu Chang, Yang Zhang, Jiliang Tang, Dawei Yin, Yi Chang, Mark A. HasegawaJohnson, Thomas S. Huang code 16
Goal-Directed Inductive Matrix Completion Si Si, KaiYang Chiang, ChoJui Hsieh, Nikhil Rao, Inderjit S. Dhillon code 16
From Truth Discovery to Trustworthy Opinion Discovery: An Uncertainty-Aware Quantitative Modeling Approach Mengting Wan, Xiangyu Chen, Lance M. Kaplan, Jiawei Han, Jing Gao, Bo Zhao code 16
Revisiting Random Binning Features: Fast Convergence and Strong Parallelizability Lingfei Wu, Ian EnHsu Yen, Jie Chen, Rui Yan code 15
Deploying Analytics with the Portable Format for Analytics (PFA) Jim Pivarski, Collin Bennett, Robert L. Grossman code 15
Efficient Processing of Network Proximity Queries via Chebyshev Acceleration Mustafa Coskun, Ananth Grama, Mehmet Koyutürk code 15
CaSMoS: A Framework for Learning Candidate Selection Models over Structured Queries and Documents Fedor Borisyuk, Krishnaram Kenthapadi, David Stein, Bo Zhao code 14
CompanyDepot: Employer Name Normalization in the Online Recruitment Industry Qiaoling Liu, Faizan Javed, Matt McNair code 14
Towards Optimal Cardinality Estimation of Unions and Intersections with Sketches Daniel Ting code 14
The Legislative Influence Detector: Finding Text Reuse in State Legislation Matthew Burgess, Eugenia Giraudy, Julian KatzSamuels, Joe Walsh, Derek Willis, Lauren Haynes, Rayid Ghani code 14
Smart Broadcasting: Do You Want to be Seen? Mohammad Reza Karimi, Erfan Tavakoli, Mehrdad Farajtabar, Le Song, Manuel GomezRodriguez code 14
Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining Kazuya Nakagawa, Shinya Suzumura, Masayuki Karasuyama, Koji Tsuda, Ichiro Takeuchi code 14
From Online Behaviors to Offline Retailing Ping Luo, Su Yan, Zhiqiang Liu, Zhiyong Shen, Shengwen Yang, Qing He code 13
Dynamic and Robust Wildfire Risk Prediction System: An Unsupervised Approach Mahsa Salehi, Laura Irina Rusu, Timothy M. Lynar, Anna Phan code 13
Come-and-Go Patterns of Group Evolution: A Dynamic Model Tianyang Zhang, Peng Cui, Christos Faloutsos, Yunfei Lu, Hao Ye, Wenwu Zhu, Shiqiang Yang code 13
Robust Large-Scale Machine Learning in the Cloud Steffen Rendle, Dennis Fetterly, Eugene J. Shekita, BorYiing Su code 12
Audience Expansion for Online Social Network Advertising Haishan Liu, David Pardoe, Kun Liu, Manoj Thakur, Frank Cao, Chongzhe Li code 12
CatchTartan: Representing and Summarizing Dynamic Multicontextual Behaviors Meng Jiang, Christos Faloutsos, Jiawei Han code 12
NetCycle: Collective Evolution Inference in Heterogeneous Information Networks Yizhou Zhang, Yun Xiong, Xiangnan Kong, Yangyong Zhu code 12
Predicting Socio-Economic Indicators using News Events Sunandan Chakraborty, Ashwin Venkataraman, Srikanth Jagabathula, Lakshminarayanan Subramanian code 12
Scalable Partial Least Squares Regression on Grammar-Compressed Data Matrices Yasuo Tabei, Hiroto Saigo, Yoshihiro Yamanishi, Simon J. Puglisi code 12
The Million Domain Challenge: Broadcast Email Prioritization by Cross-domain Recommendation Beidou Wang, Martin Ester, Yikang Liao, Jiajun Bu, Yu Zhu, Ziyu Guan, Deng Cai code 11
Ranking Universities Based on Career Outcomes of Graduates Navneet Kapur, Nikita I. Lytkin, BeeChung Chen, Deepak Agarwal, Igor Perisic code 11
QUINT: On Query-Specific Optimal Networks Liangyue Li, Yuan Yao, Jie Tang, Wei Fan, Hanghang Tong code 11
A Multiple Test Correction for Streams and Cascades of Statistical Hypothesis Tests Geoffrey I. Webb, François Petitjean code 11
Computational Social Science: Exciting Progress and Future Challenges Duncan Watts code 11
Efficient Shift-Invariant Dictionary Learning Guoqing Zheng, Yiming Yang, Jaime G. Carbonell code 11
Convex Optimization for Linear Query Processing under Approximate Differential Privacy Ganzhao Yuan, Yin Yang, Zhenjie Zhang, Zhifeng Hao code 10
Analyzing Volleyball Match Data from the 2014 World Championships Using Machine Learning Techniques Jan Van Haaren, Horesh Ben Shitrit, Jesse Davis, Pascal Fua code 10
Scalable Fast Rank-1 Dictionary Learning for fMRI Big Data Analysis Xiang Li, Milad Makkie, Binbin Lin, Mojtaba Sedigh Fazli, Ian Davidson, Jieping Ye, Tianming Liu, Shannon Quinn code 10
The Limits of Popularity-Based Recommendations, and the Role of Social Ties Marco Bressan, Stefano Leucci, Alessandro Panconesi, Prabhakar Raghavan, Erisa Terolli code 10
Accelerated Stochastic Block Coordinate Descent with Optimal Sampling Aston Zhang, Quanquan Gu code 10
Efficient Frequent Directions Algorithm for Sparse Matrices Mina Ghashami, Edo Liberty, Jeff M. Phillips code 10
Mining Reliable Information from Passively and Actively Crowdsourced Data Jing Gao, Qi Li, Bo Zhao, Wei Fan, Jiawei Han code 10
Detecting Devastating Diseases in Search Logs John Paparrizos, Ryen W. White, Eric Horvitz code 9
Engagement Capacity and Engaging Team Formation for Reach Maximization of Online Social Media Platforms Alexander G. Nikolaev, Shounak Gore, Venu Govindaraju code 9
Joint Optimization of Multiple Performance Metrics in Online Video Advertising Sahin Cem Geyik, Sergey Faleev, Jianqiang Shen, Sean O'Donnell, Santanu Kolay code 9
MAP: Frequency-Based Maximization of Airline Profits based on an Ensemble Forecasting Approach Bo An, Haipeng Chen, Noseong Park, V. S. Subrahmanian code 9
EMBERS AutoGSR: Automated Coding of Civil Unrest Events Parang Saraf, Naren Ramakrishnan code 9
FLASH: Fast Bayesian Optimization for Data Analytic Pipelines Yuyu Zhang, Mohammad Taha Bahadori, Hang Su, Jimeng Sun code 9
How to Compete Online for News Audience: Modeling Words that Attract Clicks Joon Hee Kim, Amin Mantrach, Alejandro Jaimes, Alice Oh code 8
Distributing the Stochastic Gradient Sampler for Large-Scale LDA Yuan Yang, Jianfei Chen, Jun Zhu code 7
Text Mining in Clinical Domain: Dealing with Noise Hoang Nguyen, Jon Patrick code 7
Subjectively Interesting Component Analysis: Data Projections that Contrast with Prior Expectations Bo Kang, Jefrey Lijffijt, Raúl SantosRodriguez, Tijl De Bie code 7
Accelerating the Race to Autonomous Cars Danny Shapiro code 7
Computational Drug Repositioning Using Continuous Self-Controlled Case Series Zhaobin Kuang, James A. Thomson, Michael Caldwell, Peggy L. Peissig, Ron M. Stewart, David Page code 7
Assessing Human Error Against a Benchmark of Perfection Ashton Anderson, Jon M. Kleinberg, Sendhil Mullainathan code 7
Inferring Network Effects from Observational Data David T. Arbour, Dan Garant, David D. Jensen code 7
Robust and Effective Metric Learning Using Capped Trace Norm: Metric Learning via Capped Trace Norm Zhouyuan Huo, Feiping Nie, Heng Huang code 7
When Recommendation Goes Wrong: Anomalous Link Discovery in Recommendation Networks Bryan Perozzi, Michael Schueppert, Jack Saalweachter, Mayur Thakur code 6
Collaborative Multi-View Denoising Lei Zhang, Shupeng Wang, Xiaoyu Zhang, Yong Wang, Binbin Li, Dinggang Shen, Shuiwang Ji code 6
Online Feature Selection: A Limited-Memory Substitution Algorithm and Its Asynchronous Parallel Variation Haichuan Yang, Ryohei Fujimaki, Yukitaka Kusumura, Ji Liu code 6
Lightweight Monitoring of Distributed Streams Arnon Lazerson, Daniel Keren, Assaf Schuster code 6
Bayesian Inference of Arrival Rate and Substitution Behavior from Sales Transaction Data with Stockouts Benjamin Letham, Lydia M. Letham, Cynthia Rudin code 6
Temporal Order-based First-Take-All Hashing for Fast Attention-Deficit-Hyperactive-Disorder Detection Hao Hu, Joey VelezGinorio, GuoJun Qi code 6
Burstiness Scale: A Parsimonious Model for Characterizing Random Series of Events Rodrigo Augusto da Silva Alves, Renato Martins Assunção, Pedro Olmo Stancioli Vaz de Melo code 6
Squish: Near-Optimal Compression for Archival of Relational Datasets Yihan Gao, Aditya G. Parameswaran code 6
A Real Linear and Parallel Multiple Longest Common Subsequences (MLCS) Algorithm Yanni Li, Hui Li, Tihua Duan, Sheng Wang, Zhi Wang, Yang Cheng code 6
Lossless Separation of Web Pages into Layout Code and Data Adi Omari, Benny Kimelfeld, Eran Yahav, Sharon Shoham code 6
Dynamics of Large Multi-View Social Networks: Synergy, Cannibalization and Cross-View Interplay Yu Shi, Myunghwan Kim, Shaunak Chatterjee, Mitul Tiwari, Souvik Ghosh, Rómer Rosales code 6
Compute Job Memory Recommender System Using Machine Learning Taraneh Taghavi, Maria Lupetini, Yaron Kretchmer code 5
Minimizing Legal Exposure of High-Tech Companies through Collaborative Filtering Methods Bo Jin, Chao Che, Kuifei Yu, Yue Qu, Li Guo, Cuili Yao, Ruiyun Yu, Qiang Zhang code 5
Lexis: An Optimization Framework for Discovering the Hierarchical Structure of Sequential Data Payam Siyari, Bistra Dilkina, Constantine Dovrolis code 5
Predictors without Borders: Behavioral Modeling of Product Adoption in Three Developing Countries Muhammad Raza Khan, Joshua E. Blumenstock code 5
Privacy-preserving Class Ratio Estimation Arun Shankar Iyer, J. Saketha Nath, Sunita Sarawagi code 5
Sampling of Attributed Networks from Hierarchical Generative Models Pablo RoblesGranda, Sebastián Moreno, Jennifer Neville code 5
Identifying Decision Makers from Professional Social Networks Shipeng Yu, Evangelia Christakopoulou, Abhishek Gupta code 5
A Non-parametric Approach to Detect Epileptogenic Lesions using Restricted Boltzmann Machines Yijun Zhao, Bilal Ahmed, Thomas Thesen, Karen E. Blackmon, Jennifer G. Dy, Carla E. Brodley, Ruben Kuzniecky, Orrin Devinsky code 5
Communication Efficient Distributed Kernel Principal Component Analysis MariaFlorina Balcan, Yingyu Liang, Le Song, David P. Woodruff, Bo Xie code 5
From Prediction to Action: A Closed-Loop Approach for Data-Guided Network Resource Allocation Yanan Bao, Huasen Wu, Xin Liu code 5
Lifelong Machine Learning and Computer Reading the Web Zhiyuan Chen, Estevam R. Hruschka Jr., Bing Liu code 4
Continuous Experience-aware Language Model Subhabrata Mukherjee, Stephan Günnemann, Gerhard Weikum code 4
Graph Wavelets via Sparse Cuts Arlei Silva, XuanHong Dang, Prithwish Basu, Ambuj K. Singh, Ananthram Swami code 4
Towards Robust and Versatile Causal Discovery for Business Applications Giorgos Borboudakis, Ioannis Tsamardinos code 4
Optimal Reserve Prices in Upstream Auctions: Empirical Application on Online Video Advertising Miguel Angel Alcobendas Lisbona, Sheide Chammas, Kuangchih Lee code 3
Generalized Hierarchical Sparse Model for Arbitrary-Order Interactive Antigenic Sites Identification in Flu Virus Data Lei Han, Yu Zhang, XiuFeng Wan, Tong Zhang code 3
Predict Risk of Relapse for Patients with Multiple Stages of Treatment of Depression Zhi Nie, Pinghua Gong, Jieping Ye code 3
Absolute Fused Lasso and Its Application to Genome-Wide Association Studies Tao Yang, Jun Liu, Pinghua Gong, Ruiwen Zhang, Xiaotong Shen, Jieping Ye code 3
Designing Policy Recommendations to Reduce Home Abandonment in Mexico Klaus Ackermann, Eduardo Blancas Reyes, Sue He, Thomas Anderson Keller, Paul van der Boor, Romana Khan, Rayid Ghani, José Carlos González code 2
Online Dual Decomposition for Performance and Delivery-Based Distributed Ad Allocation Jim C. Huang, Rodolphe Jenatton, Cédric Archambeau code 2
The Wisdom of Crowds: Best Practices for Data Prep & Machine Learning Derived from Millions of Data Science Workflows Ingo Mierswa code 2
People, Computers, and The Hot Mess of Real Data Joseph M. Hellerstein code 2
Learning Sparse Models at Scale Ralf Herbrich code 2
Compact and Scalable Graph Neighborhood Sketching Takuya Akiba, Yosuke Yano code 2
Annealed Sparsity via Adaptive and Dynamic Shrinking Kai Zhang, Shandian Zhe, Chaoran Cheng, Zhi Wei, Zhengzhang Chen, Haifeng Chen, Guofei Jiang, Yuan Qi, Jieping Ye code 2
Causal Clustering for 1-Factor Measurement Models Erich Kummerfeld, Joseph D. Ramsey code 2
Parallel Lasso Screening for Big Data Optimization Qingyang Li, Shuang Qiu, Shuiwang Ji, Paul M. Thompson, Jieping Ye, Jie Wang code 2
Leveraging Propagation for Data Mining: Models, Algorithms and Applications B. Aditya Prakash, Naren Ramakrishnan code 2
Healthcare Data Mining with Matrix Models Fei Wang, Ping Zhang, Joel Dudley code 2
Graphons and Machine Learning: Modeling and Estimation of Sparse Massive Networks Jennifer T. Chayes code 1
Fast Component Pursuit for Large-Scale Inverse Covariance Estimation Lei Han, Yu Zhang, Tong Zhang code 1
Learning to Learn and Compositionality with Deep Recurrent Neural Networks: Learning to Learn and Compositionality Nando de Freitas code 1
The Evolving Meaning of Information Security Whitfield Diffie code 1
Identifying Earmarks in Congressional Bills Ellery Wulczyn, Madian Khabsa, Vrushank Vora, Matthew Heston, Joe Walsh, Christopher Berry, Rayid Ghani code 1
Batch Model for Batched Timestamps Data Analysis with Application to the SSA Disability Program Qingqi Yue, Ao Yuan, Xuan Che, Minh Huynh, Chunxiao Zhou code 1
Bayesian Optimization and Embedded Learning Systems Jeff Schneider code 1
Collective Sensemaking via Social Sensors: Extracting, Profiling, Analyzing, and Predicting Real-world Events Yuheng Hu, YuRu Lin, Jiebo Luo code 1
Scalable Learning of Graphical Models François Petitjean, Geoffrey I. Webb code 1
Business Applications of Predictive Modeling at Scale Qiang Zhu, Songtao Guo, Paul Ogilvie, Yan Liu code 1
Profiling Users from Online Social Behaviors with Applications for Tencent Social Ads Ching Law code 0
Large-Scale Machine Learning at Verizon: Theory and Applications Ashok Srivastava code 0
How Machine Learning has Finally Solved Wanamaker's Dilemma Oliver Downs code 0
Streaming Analytics Ashish Gupta, Neera Agarwal code 0
A VC View of Investing in ML Greg Papadopoulos code 0
Big Data Needs Big Dreamers: Lessons from Successful Big Data Investors Evangelos Simoudis, Mark Gorenberg, Tim Guleri, Matt Ocko, Greg Sands code 0
Can You Teach the Elephant to Dance? AKA: Culture Eats Data Science for Breakfast Jonathan D. Becher code 0
Scalable Time-Decaying Adaptive Prediction Algorithm Yinyan Tan, Zhe Fan, Guilin Li, Fangshan Wang, Zhengbing Li, Shikai Liu, Qiuling Pan, Eric P. Xing, Qirong Ho code 0
Optimally Discriminative Choice Sets in Discrete Choice Models: Application to Data-Driven Test Design Igor Labutov, Frans Schalekamp, Kelvin Luu, Hod Lipson, Christoph Studer code 0
Improving Survey Aggregation with Sparsely Represented Signals Tianlin Shi, Forest Agostinelli, Matthew Staib, David P. Wipf, Thomas Moscibroda code 0
Scalable Data Analytics Using R: Single Machines to Hadoop Spark Clusters JohnMark Agosta, Debraj GuhaThakurta, Robert Horton, Mario Inchiosa, Srini Kumar, Mengyue Zhao code 0