Main Page

Table of Contents

Author Index

Table of Contents

Preface from General Chairs
Xueqi Cheng (Chinese Academy of Sciences)

Hang Li (Huawei Technologies)

Preface by the Program Chairs
Evgeniy Gabrilovich (Google)

Jie Tang (Tsinghua University)

WSDM 2015 Organization

WSDM 2015 Program Committee & Additional Reviewers

WSDM 2015 Sponsors & Supporters

Keynote Address 1 Keynote Address 2 Keynote Address 3
Session 1: Practice & Experience Talk + Panel on Large Scale Data Understanding Session 4: Web Mining Session 7: User Modeling, Mobility, and Recommendations
Session 2: Web Search Session 5: Practice & Experience Talks Session 8: Practice & Experience Talks
Session 3: Social Networks Session 6: Crowdsourcing, Temporal and Location-based Mining Session 9: Web Mining (2)
Tutorials Workshop Summaries Doctoral Consortium
(Return to Top)

Keynote Address 1
Session Chair: Xueqi Cheng (Chinese Academy of Sciences)

Making Sense of Big Data with the Berkeley Data Analytics Stack (Page 1)
Michael Franklin (University of California, Berkeley)

(Return to Top)

Session 1: Practice & Experience Talk + Panel on Large Scale Data Understanding
Session Chairs: Ying Li [P&E Talk] (EV Analysis Corporation)
and Andrei Broder [Panel] (Google)

New Directions in Recommender Systems (Page 3)
Jure Leskovec (Stanford University)

Big Data: New Paradigm or "Sound and Fury, Signifying Nothing"? (Page 5)
Andrei Broder (Google)

Lada Adamic (Facebook)

Michael Franklin (University of California, Berkeley)

Maarten de Rijke (University of Amsterdam)

Eric Xing (Carnegie Mellon University)

Kai Yu (Baidu)

(Return to Top)

Session 2: Web Search
Session Chair: Maarten De Rijke (University of Amsterdam)

Delayed-Dynamic-Selective (DDS) Prediction for Reducing Extreme Tail Latency in Web Search (Page 7)
Saehoon Kim (POSTECH)

Yuxiong He (Microsoft Research)

Seung-won Hwang (POSTECH)

Sameh Elnikety (Microsoft Research)

Seungjin Choi (POSTECH)

MergeRUCB: A Method for Large-Scale Online Ranker Evaluation (Page 17)
Masrour Zoghi (University of Amsterdam)

Shimon Whiteson (University of Amsterdam)

Maarten de Rijke (University of Amsterdam)

Engagement Periodicity in Search Engine Usage: Analysis and its Application to Search Quality Evaluation (Page 27)
Alexey Drutsa (Yandex)

Gleb Gusev (Yandex)

Pavel Serdyukov (Yandex)

Toward Predicting the Outcome of an A/B Experiment for Search Relevance (Page 37)
Lihong Li (Microsoft Corp)

Jin Young Kim (Microsoft Corp)

Imed Zitouni (Microsoft Corp)

Optimal Space-time Tradeoffs for Inverted Indexes (Page 47)
Giuseppe Ottaviano (National Research Council of Italy)

Nicola Tonellotto (National Research Council of Italy)

Rossano Venturini (National Research Council of Italy & University of Pisa)

Understanding and Predicting Graded Search Satisfaction (Page 57)
Jiepu Jiang (University of Massachusetts Amherst)

Ahmed Hassan Awadallah (Microsoft Research Redmond)

Xiaolin Shi (Microsoft Research Redmond)

Ryen W. White (Microsoft Research Redmond)

Robust Tree-based Causal Inference for Complex Ad Effectiveness Analysis (Page 67)
Pengyuan Wang (Yahoo Labs)

Wei Sun (Purdue University)

Dawei Yin (Yahoo Labs)

Jian Yang (Yahoo Labs)

Yi Chang (Yahoo Labs)

(Return to Top)

Session 3: Social Networks
Session Chair: Elad Yom-Tov (Microsoft Research)

The Power of Random Neighbors in Social Networks (Page 77)
Silvio Lattanzi (Google, Inc.)

Yaron Singer (Harvard University)

Negative Link Prediction in Social Media (Page 87)
Jiliang Tang (Arizona State University)

Shiyu Chang (University of Illinois at Urbana-Champaign)

Charu Aggarwal (IBM T.J. Watson Research Center)

Huan Liu (Arizona State University)

Sarcasm Detection on Twitter: A Behavioral Modeling Approach (Page 97)
Ashwin Rajadesingan (Arizona State University)

Reza Zafarani (Arizona State University)

Huan Liu (Arizona State University)

Modeling and Predicting Retweeting Dynamics on Microblogging Platforms (Page 107)
Shuai Gao (Shandong University)

Jun Ma (Shandong University)

Zhumin Chen (Shandong University)

On Integrating Network and Community Discovery (Page 117)
Jialu Liu (University of Illinois at Urbana-Champaign)

Charu Aggarwal (IBM T.J. Watson Research Center)

Jiawei Han (University of Illinois at Urbana-Champaign)

On the Accuracy of Hyper-local Geotagging of Social Media Content (Page 127)
David Flatow (Cornell Tech & Stanford University)

Mor Naaman (Cornell Tech)

Ke Eddie Xie (Cornell Tech & Twitter Inc.)

Yana Volkovich (Cornell Tech & Barcelona Media)

Yaron Kanza (Cornell Tech & Technion - Israel Institute of Technology)

(Return to Top)

Keynote Address 2
Session Chair: Evgeniy Gabrilovich (Google)

The Information Life of Social Networks (Page 273)
Lada A. Adamic (Facebook)

(Return to Top)

Session 4: Web Mining
Session Chair: Huan Liu (Arizona State University)

Learning to Recommend Related Entities to Search Users (Page 139)
Bin Bi (University of California, Los Angeles)

Hao Ma (Microsoft Research)

Bo-June (Paul) Hsu (Microsoft Research)

Wei Chu (Microsoft)

Kuansan Wang (Microsoft Research)

Junghoo Cho (University of California, Los Angeles)

Will This Paper Increase Your h-Index? Scientific Impact Prediction (Page 149)
Yuxiao Dong (University of Notre Dame)

Reid A. Johnson (University of Notre Dame)

Nitesh V. Chawla (University of Notre Dame)

Concept Graph Learning from Educational Data (Page 159)
Yiming Yang (Carnegie Mellon University)

Hanxiao Liu (Carnegie Mellon University)

Jaime Carbonell (Carnegie Mellon University)

Wanli Ma (Carnegie Mellon University)

Review Synthesis for Micro-Review Summarization (Page 169)
Thanh-Son Nguyen (Singapore Management University)

Hady W. Lauw (Singapore Management University)

Panayiotis Tsaparas (University of Ioannina)

Fast and Space-Efficient Entity Linking in Queries (Page 179)
Roi Blanco (Yahoo Labs)

Giuseppe Ottaviano (ISTI-CNR)

Edgar Meij (Yahoo Labs)

On Tag Recommendation for Expertise Profiling: A Case Study in the Scientific Domain (Page 189)
Isac S. Ribeiro (Universidade Federal de Minas Gerais)

Rodrygo L. T. Santos (Universidade Federal de Minas Gerais)

Marcos A. Gonçalves (Universidade Federal de Minas Gerais)

Alberto H. F. Laender (Universidade Federal de Minas Gerais)

FLAME: A Probabilistic Model Combining Aspect Based Opinion Mining and Collaborative Filtering (Page 199)
Yao Wu (Simon Fraser University)

Martin Ester (Simon Fraser University)

(Return to Top)

Session 5: Practice & Experience Talks
Session Chair: Paul Bennett (Microsoft)

Semantic Matching in APP Search (Page 209)
Juchao Zhuo (Tencent Inc.)

Zeqian Huang (Tencent Inc.)

Yunfeng Liu (Tencent Inc.)

Zhanhui Kang (Tencent Inc.)

Xun Cao (Tencent Inc.)

Mingzhi Li (Tencent Inc.)

Long Jin (Tencent Inc.)

Boosting Search with Deep Understanding of Contents and Users (Page 211)
Kaihua Zhu (Baidu)

(Return to Top)

Session 6: Crowdsourcing, Temporal and Location-based Mining
Session Chair: Charlie Clarke (University of Waterloo)

Driven by Food: Modeling Geographic Choice (Page 213)
Ravi Kumar (Google)

Mohammad Mahdian (Google)

Bo Pang (Google)

Andrew Tomkins (Google)

Sergei Vassilvitskii (Google)

Hiring Behavior Models for Online Labor Markets (Page 223)
Marios Kokkodis (NYU Stern)

Panagiotis Papadimitriou (Elance-oDesk)

Panagiotis G. Ipeirotis (NYU Stern)

Just in Time Recommendations - Modeling the Dynamics of Boredom in Activity Streams (Page 233)
Komal Kapoor (University of Minnesota)

Karthik Subbian (University of Minnesota)

Jaideep Srivastava (University of Minnesota)

Paul Schrater (University of Minnesota)

Leveraging In-Batch Annotation Bias for Crowdsourced Active Learning (Page 243)
Honglei Zhuang (LinkedIn Corporation & University of Illinois at Urbana-Champaign)

Joel Young (LinkedIn Corporation)

Listwise Approach for Rank Aggregation in Crowdsourcing (Page 253)
Shuzi Niu (Chinese Academy of Sciences)

Yanyan Lan (Chinese Academy of Sciences)

Jiafeng Guo (Chinese Academy of Sciences)

Xueqi Cheng (Chinese Academy of Sciences)

Lei Yu (Chinese Academy of Sciences)

Guoping Long (Chinese Academy of Sciences)

WorkerRank: Using Employer Implicit Judgements to Infer Worker Reputation (Page 263)
Maria Daltayanni (University of California, Santa Cruz)

Luca de Alfaro (University of California, Santa Cruz)

Panagiotis Papadimitriou (Elance-oDesk)

(Return to Top)

Keynote Address 3
Session Chair: Jie Tang (Tsinghua University)

Learning from User Interactions (Page 137)
Thorsten Joachims (Cornell University)

(Return to Top)

Session 7: User Modeling, Mobility, and Recommendations
Session Chair: Grace Hui Yang (Georgetown University)

User Modeling for a Personal Assistant (Page 275)
Ramanathan Guha (Google)

Vineet Gupta (Google)

Vivek Raghunathan (Google)

Ramakrishnan Srikant (Google)

Predicting the Next App that You Are Going to Use (Page 285)
Ricardo Baeza-Yates (Yahoo Labs)

Di Jiang (HKUST)

Fabrizio Silvestri (Yahoo Labs)

Beverly Harrison (Yahoo Labs)

You Are Where You Go: Inferring Demographic Attributes from Location Check-Ins (Page 295)
Yuan Zhong (Microsoft Research & Northeastern University)

Nicholas Jing Yuan (Microsoft Research)

Wen Zhong (Stony Brook University)

Fuzheng Zhang (University of Science and Technology of China & Microsoft Research)

Xing Xie (Microsoft Research)

SimApp: A Framework for Detecting Similar Mobile Applications by Online Kernel Learning (Page 305)
Ning Chen (Nanyang Technological University)

Steven C. H. Hoi (Singapore Management University)

Shaohua Li (Nanyang Technological University)

Xiaokui Xiao (Nanyang Technological University)

Personalized Mobile App Recommendation: Reconciling App Functionality and User Privacy Preference (Page 315)
Bin Liu (Rutgers University)

Deguang Kong (Samsung Research America)

Lei Cen (Purdue University)

Neil Zhenqiang Gong (University of California, Berkeley)

Hongxia Jin (Samsung Research America)

Hui Xiong (Rutgers University)

Inferring Movement Trajectories from GPS Snippets (Page 325)
Mu Li (Carnegie Mellon University)

Amr Ahmed (Google Strategic Technologies)

Alexander J. Smola (Carnegie Mellon University & Google Strategic Technologies)

(Return to Top)

Session 8: Practice & Experience Talks
Session Chair: Xuanjing Huang (Fudan University)

Regressing Towards Simpler Prediction Systems (Page 335)
Tushar Chandra (Google, Inc)

Global Optimization for Display Ad (Page 337)
Rong Ji (Alibaba)

(Return to Top)

Session 9: Web Mining (2)
Session Chair: Fabrizio Silvestri (Yahoo!)

Back to the Past: Supporting Interpretations of Forgotten Stories by Time-aware Re-Contextualization (Page 339)
Nam Khanh Tran (Leibniz Universität)

Andrea Ceroni (Leibniz Universität)

Nattiya Kanhabua (Leibniz Universität)

Claudia Niederée (Leibniz Universität)

Diluted Treatment Effect Estimation for Trigger Analysis in Online Controlled Experiments (Page 349)
Alex Deng (Microsoft)

Victor Hu (Microsoft)

Inverting a Steady-State (Page 359)
Ravi Kumar (Google)

Andrew Tomkins (Google)

Sergei Vassilvitskii (Google)

Erik Vee (Google)

Automatic Gloss Finding for a Knowledge Base Using Ontological Constraints (Page 369)
Bhavana Dalvi (Carnegie Mellon University)

Einat Minkov (University of Haifa)

Partha P. Talukdar (Indian Institute of Science)

William W. Cohen (Carnegie Mellon University)

Finding Subgraphs with Maximum Total Density and Limited Overlap (Page 379)
Oana Denisa Balalau (Telecom Paristech)

Francesco Bonchi (Yahoo Labs)

T-H. Hubert Chan (The University of Hong Kong)

Francesco Gullo (Yahoo Labs)

Mauro Sozio (Telecom Paristech)

Modeling Website Popularity Competition in the Attention-Activity Marketplace (Page 389)
Bruno Ribeiro (Carnegie Mellon University)

Christos Faloutsos (Carnegie Mellon University)

Exploring the Space of Topic Coherence Measures (Page 399)
Michael Röder (Leipzig University)

Andreas Both (Unister GmbH)

Alexander Hinneburg (Martin-Luther-University)

(Return to Top)

Tutorials

Dynamic Information Retrieval Modeling (Page 409)
Hui Yang (Georgetown University)

Marc Sloan (University College London)

Jun Wang (University College London)

Scalability and Efficiency Challenges in Large-Scale Web Search Engines (Page 411)
B. Barla Cambazoglu (Yahoo Labs)

Ricardo Baeza-Yates (Yahoo Labs)

Offline Evaluation and Optimization for Interactive Systems (Page 413)
Lihong Li (Microsoft Research)

Real-Time Bidding: A New Frontier of Computational Advertising Research (Page 415)
Jun Wang (University College London)

Shuai Yuan (University College London)

Learning About Health and Medicine from Internet Data (Page 417)
Elad Yom-Tov (Microsoft Research)

Ingemar Johansson Cox (University College London)

Vasileios Lampos (University College London)

Distributed Graph Algorithmics: Theory and Practice (Page 419)
Silvio Lattanzi (Google Research)

Vahab Mirrokni (Google Research)

(Return to Top)

Workshop Summaries

DL-WSDM'15: Workshop on Deep Learning for Web Search and Data Mining (Page 421)
Bin Gao (Microsoft Research)

Jiang Bian (Microsoft Research)

HIA'15: Heterogeneous Information Access Workshop at WSDM 2015 (Page 423)
Ke Zhou (Yahoo Labs)

Roger Jie Luo (Yahoo Labs)

Djoerd Hiemstra (University of Twente)

Joemon M. Jose (University of Glasgow)

WSDM'15 Workshop Summary / Scalable Data Analytics: Theory and Applications (Page 425)
Kaizhu Huang (Xi'an Jiaotong-Liverpool University)

Haiqin Yang (Chinese University of Hong Kong)

Irwin King (Chinese University of Hong Kong)

Michael R. Lyu (Chinese University of Hong Kong)

The 2nd Workshop on Vertical Search Relevance at WSDM 2015 (Page 427)
Dawei Yin (Yahoo Labs)

Chih-Chieh Hung (Rakuten Inc.)

Rui Li (Yahoo Labs)

Yi Chang (Yahoo Labs)

(Return to Top)

Doctoral Consortium

An Approach to the Problem of Annotation of Research Publications (Page 429)
Ekaterina Chernyak (National Research University)

Incorporating Phrase-Level Sentiment Analysis on Textual Reviews for Personalized Recommendation (Page 435)
Yongfeng Zhang (Tsinghua University)

Mining Groups Stability in Ubiquitous and Social Environments: Communities, Classes and Clusters (Page 441)
Mark Kibanov (University of Kassel)

Sentiment-Specific Representation Learning for Document-Level Sentiment Analysis (Page 447)
Duyu Tang (Harbin Institute of Technology)

Chronological Scientific Information Recommendation via Supervised Dynamic Topic Modeling (Page 453)
Zhuoren Jiang (Dalian Maritime University)

Topics, Tasks & Beyond: Learning Representations for Personalization (Page 459)
Rishabh Mehrotra (University College London)