Jun Yang

D327 Levine Science Research Center
Box 90129
Duke University
Durham, North Carolina 27708-0129
Tel: 919-660-6587
Fax: 919-660-6519
Web: http://www.cs.duke.edu/~junyang/
Email: <cs.duke.edu, junyang>

Research Interests

Education

Professional Experience

Publications

Published work:
  1. You Wu, Pankaj K. Agarwal, Chengkai Li, Jun Yang, and Cong Yu. "Toward computational fact-checking." Proceedings of the VLDB Endowment, 7(7):589-600, 2014. [paper]
  2. Naeemul Hassan, Afroza Sultana, You Wu, Gensheng Zhang, Chengkai Li, Jun Yang, and Cong Yu. "Data in, fact out: automated monitoring of facts by FactWatcher." Proceedings of the VLDB Endowment, 7(13), 2014. Demonstration track. [paper]
  3. You Wu, Brett Walenz, Peggy Li, Andrew Shim, Emre Sonmez, Pankaj K. Agarwal, Chengkai Li, Jun Yang, and Cong Yu. "iCheck: computationally combating “lies, d—ned lies, and statistics”." In Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, Snowbird, Utah, USA, June 2014. Demonstration track. [paper]
  4. Albert Yu, Pankaj K. Agarwal, and Jun Yang. "Top-k preferences in high dimensions." In Proceedings of the 2014 International Conference on Data Engineering, Chicago, Illinois, USA, March 2014.
  5. Afroza Sultana, Naeemul Hassan, Chengkai Li, Jun Yang, and Cong Yu. "Incremental discovery of prominent situational facts." In Proceedings of the 2014 International Conference on Data Engineering, Chicago, Illinois, USA, March 2014.
  6. Risi Thonangi and Jun Yang. "Permuting data on random-access block storage." Proceedings of the VLDB Endowment, 6(9):721-732, 2013. [errata, paper, and report]
  7. Botong Huang, Shivnath Babu, and Jun Yang. "Cumulon: optimizing statistical data analysis in the cloud." In Proceedings of the 2013 ACM SIGMOD International Conference on Management of Data, New York City, New York, USA, June 2013. [paper and slides]
  8. Yi Zhang, Kristian Lum, and Jun Yang. "Failure-aware cascaded suppression in wireless sensor networks." IEEE Transactions on Knowledge and Data Engineering, 25(5):1042-1055, May 2013. [paper and supplemental]
  9. Pankaj K. Agarwal, Lars Arge, Sathish Govindarajan, Jun Yang, and Ke Yi. "Efficient external memory structures for range-aggregate queries." Computational Geometry: Theory and Applications, 46(3):358-370, April 2013. [paper]
  10. Albert Yu, Pankaj K. Agarwal, and Jun Yang. "Subscriber assignment for wide-area content-based publish/subscribe." IEEE Transactions on Knowledge and Data Engineering, 24(10):1833-1847, 2012. Invited as a special selection from ICDE 2011. [paper and supplemental]
  11. S. N. Lahiri, XuanLong Nguyen, Jun Yang, Zhengyuan Zhu, and P. Banerjee. "Wireless sensor networks: statistical issues and challenges." Journal of the Indian Statistical Association, 50(1–2):151-191, 2012.
  12. Rada Chirkova and Jun Yang. "Materialized views." Foundations and Trends in Databases, 4(4):295-405, 2012. [paper]
  13. Risi Thonangi, Shivnath Babu, and Jun Yang. "A practical concurrent index for solid-state drives." In Proceedings of the 2012 International Conference on Information and Knowledge Management, pages 1332-1341, Maui, Hawaii, USA, October 2012. Databases track. [paper and report]
  14. You Wu, Pankaj K. Agarwal, Chengkai Li, Jun Yang, and Cong Yu. "On “one of the few” objects." In Proceedings of the 2012 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1487-1495, Beijing, China, August 2012. [paper and report]
  15. Yi Zhang and Jun Yang. "Optimizing I/O for big array analytics." Proceedings of the VLDB Endowment, 5(8):764-775, June 2012. [paper]
  16. Albert Yu, Pankaj K. Agarwal, and Jun Yang. "Processing a large number of continuous preference top-k queries." In Proceedings of the 2012 ACM SIGMOD International Conference on Management of Data, pages 397-408, Scottsdale, Arizona, USA, May 2012. [paper]
  17. Albert Yu, Pankaj K. Agarwal, and Jun Yang. "Processing and notifying range top-k subscriptions." In Proceedings of the 2012 International Conference on Data Engineering, pages 810-821, Washington DC, USA, April 2012. [paper and report]
  18. Yi Zhang, Kamesh Munagala, and Jun Yang. "Storing matrices on disk: theory and practice revisited." Proceedings of the VLDB Endowment, 4(11):1075-1086, August 2011. [paper and report]
  19. James S. Clark, Pankaj K. Agarwal, David M. Bell, Paul G. Flikkema, Alan Gelfand, Xuanlong Nguyen, Eric Ward, and Jun Yang. "Inferential ecosystem models, from network data to prediction." Ecological Applications, 21(5):1523-1536, July 2011.
  20. Albert Yu, Pankaj K. Agarwal, and Jun Yang. "Subscriber assignment for wide-area content-based publish/subscribe." In Proceedings of the 2011 International Conference on Data Engineering, pages 267-278, Hannover, Germany, April 2011. Results in this paper are subsumed by those in the TKDE 2012 paper by the same authors. [paper and report]
  21. Sarah Cohen, Chengkai Li, Jun Yang, and Cong Yu. "Computational journalism: a call to arms to database researchers." In Proceedings of the 2011 Conference on Innovative Data Systems Research, Asilomar, California, USA, January 2011. Outrageous ideas and vision track. Third-place winner of the Best Outrageous Ideas and Vision Track Paper Competition sponsored by the Computing Community Consortium. [paper and slides]
  22. Lei Chen, Changjie Tang, Jun Yang, and Yunjun Gao, ed. Proceedings of the 2010 International Conference on Web-Age Information Management, Jiuzhaigou, Sichuan, China, July 2010. Lecture Notes in Computer Science 6184. Springer.
  23. Yi Zhang, Weiping Zhang, and Jun Yang. "I/O-efficient statistical computing with RIOT." In Proceedings of the 2010 International Conference on Data Engineering, pages 1157-1160, Long Beach, California, USA, March 2010. Demonstration track. [paper and poster]
  24. Jun Yang, Kamesh Munagala, and Adam Silberstein. "Data aggregation in sensor networks." In Encyclopedia of Database Systems. Ling Liu and M. Tamer Özsu, ed. Springer. 2009. Invited contribution.
  25. Albert Yu, Pankaj K. Agarwal, and Jun Yang. "Generating wide-area content-based publish/subscribe workloads." In Proceedings of the 2009 Workshop on Networking Meets Databases, Big Sky, Montana, USA, October 2009. [paper]
  26. Pankaj K. Agarwal, Junyi Xie, Jun Yang, and Hai Yu. "Input-sensitive scalable continuous join query processing." ACM Transactions on Database Systems, 34(3):1-41, August 2009. [paper]
  27. Fei Chen, Byron J. Gao, AnHai Doan, Jun Yang, and Raghu Ramakrishnan. "Optimizing complex extraction programs over evolving text data." In Proceedings of the 2009 ACM SIGMOD International Conference on Management of Data, pages 321-334, Providence, Rhode Island, USA, June 2009. [paper]
  28. Risi Thonangi, Hao He, AnHai Doan, Haixun Wang, and Jun Yang. "Weighted proximity best-joins for information retrieval." In Proceedings of the 2009 International Conference on Data Engineering, pages 234-245, Shanghai, China, March 2009. [paper]
  29. Yi Zhang, Herodotos Herodotou, and Jun Yang. "RIOT: I/O-efficient numerical computing without SQL." In Proceedings of the 2009 Conference on Innovative Data Systems Research, Asilomar, California, USA, January 2009. [paper and slides]
  30. Badrish Chandramouli and Jun Yang. "End-to-end support for joins in large-scale publish/subscribe systems." In Proceedings of the 2008 International Conference on Very Large Data Bases, pages 434-450, Auckland, New Zealand, August 2008. Infrastructure track. [paper]
  31. Badrish Chandramouli, Jun Yang, Pankaj K. Agarwal, Albert Yu, and Ying Zheng. "ProSem: scalable wide-area publish/subscribe." In Proceedings of the 2008 ACM SIGMOD International Conference on Management of Data, pages 1315-1318, Vancouver, Canada, June 2008. Demonstration track. Acceptance rate: 31.9 percent. [paper]
  32. Junyi Xie, Jun Yang, Yuguo Chen, Haixun Wang, and Philip S. Yu. "A sampling-based approach to information recovery." In Proceedings of the 2008 International Conference on Data Engineering, pages 476-485, Cancun, Mexico, April 2008. Short presentation track. Acceptance rate: 19.2 percent of 715. Full paper. [paper]
  33. Fei Chen, AnHai Doan, Jun Yang, and Raghu Ramakrishnan. "Efficient information extraction over evolving text data." In Proceedings of the 2008 International Conference on Data Engineering, pages 943-952, Cancun, Mexico, April 2008. Acceptance rate: 12.1 percent of 715. [paper]
  34. Magdalena Balazinska, Amol Deshpande, Alexandros Labrinidis, Qiong Luo, Samuel Madden, and Jun Yang. "Report on the fourth international workshop on data management for sensor networks (DMSN 2007)." ACM SIGMOD Record, 36(4):53-55, 2007.
  35. Adam Silberstein, Gavino Puggioni, Alan E. Gelfand, Kamesh Munagala, and Jun Yang. "Suppression and failures in sensor data: a Bayesian approach." In Proceedings of the 2007 International Conference on Very Large Data Bases, pages 842-853, Vienna, Austria, September 2007. Infrastructure track. Acceptance rate: 45 out of 275. [paper]
  36. Badrish Chandramouli, Jeff M. Phillips, and Jun Yang. "Value-based notification conditions in large-scale publish/subscribe systems." In Proceedings of the 2007 International Conference on Very Large Data Bases, pages 878-889, Vienna, Austria, September 2007. Infrastructure track. Acceptance rate: 45 out of 275. [paper]
  37. Magdalena Balazinska, Amol Deshpande, Qiong Luo, and Jun Yang, ed. Proceedings of the 2007 International Workshop on Data Management for Sensor Networks, Vienna, Austria, September 2007.
  38. Hao He, Haixun Wang, Jun Yang, and Philip S. Yu. "BLINKS: ranked keyword searches on graphs." In Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, pages 305-316, Beijing, China, June 2007. Acceptance rate: 70 out of 480. [paper and report]
  39. Badrish Chandramouli, Christopher N. Bond, Shivnath Babu, and Jun Yang. "Query suspend and resume." In Proceedings of the 2007 ACM SIGMOD International Conference on Management of Data, pages 557-568, Beijing, China, June 2007. Acceptance rate: 70 out of 480. [paper and report]
  40. Adam Silberstein and Jun Yang. "Many-to-many aggregation for sensor networks." In Proceedings of the 2007 International Conference on Data Engineering, pages 986-995, Istanbul, Turkey, April 2007. Acceptance rate: 122 out of 659. [paper and report]
  41. Badrish Chandramouli, Christopher Bond, Shivnath Babu, and Jun Yang. "On suspending and resuming dataflows." In Proceedings of the 2007 International Conference on Data Engineering, pages 1289-1291, Istanbul, Turkey, April 2007. Poster track. Acceptance rate: 60(+122) out of 659. Results in this paper are subsumed by those in the SIGMOD 2007 paper by the same authors.
  42. Adam Silberstein, Gregory Filpus, Kamesh Munagala, and Jun Yang. "Data-driven processing in sensor networks." In Proceedings of the 2007 Conference on Innovative Data Systems Research, pages 10-21, Asilomar, California, USA, January 2007. Acceptance rate: 34 out of 98. [paper]
  43. Junyi Xie and Jun Yang. "A survey of join processing in data streams." In Data Streams: Models and Algorithms. Charu C. Aggarwal, ed. Springer. November 2006. Invited contribution. [paper]
  44. Pankaj K. Agarwal, Junyi Xie, Jun Yang, and Hai Yu. "Scalable continuous query processing by tracking hotspots." In Proceedings of the 2006 International Conference on Very Large Data Bases, pages 31-42, Seoul, Korea, September 2006. Core database track. Acceptance rate: 46 out of 334. Results in this paper are subsumed by those in the 2009 TODS paper by the same authors. [paper and report]
  45. Adam Silberstein, Kamesh Munagala, and Jun Yang. "Energy-efficient monitoring of extreme values in sensor networks." In Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pages 169-180, Chicago, Illinois, USA, June 2006. Acceptance rate: 58 out of 446. [paper]
  46. Adam Silberstein, Rebecca Braynard, and Jun Yang. "Constraint chaining: on energy-efficient continuous monitoring in sensor networks." In Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pages 157-168, Chicago, Illinois, USA, June 2006. Acceptance rate: 58 out of 446. [paper]
  47. Badrish Chandramouli, Junyi Xie, and Jun Yang. "On the database/network interface in large-scale publish/subscribe systems." In Proceedings of the 2006 ACM SIGMOD International Conference on Management of Data, pages 587-598, Chicago, Illinois, USA, June 2006. Acceptance rate: 58 out of 446. [paper and report]
  48. Paul G. Flikkema, Pankaj K. Agarwal, James S. Clark, Carla Schlatter Ellis, Alan Gelfand, Kamesh Munagala, and Jun Yang. "Model-driven dynamic control of embedded wireless sensor networks." In Proceedings of the 2006 International Conference on Computational Science, pages 409-416, Reading, United Kingdom, May 2006.
  49. Haixun Wang, Hao He, Jun Yang, Philip S. Yu, and Jeffrey Xu Yu. "Dual labeling: answering graph reachability queries in constant time." In Proceedings of the 2006 International Conference on Data Engineering, Atlanta, Georgia, USA, April 2006. Acceptance rate: 89 out of 456. [paper]
  50. Adam Silberstein, Rebecca Braynard, and Jun Yang. "Energy-efficient continuous isoline queries in sensor networks." In Proceedings of the 2006 International Conference on Data Engineering, Atlanta, Georgia, USA, April 2006. Poster track. Results in this paper are subsumed by those in the SIGMOD 2006 paper by the same authors [paper]
  51. Adam Silberstein, Rebecca Braynard, Carla Ellis, Kamesh Munagala, and Jun Yang. "A sampling-based approach to optimizing top-k queries in sensor networks." In Proceedings of the 2006 International Conference on Data Engineering, Atlanta, Georgia, USA, April 2006. Acceptance rate: 89 out of 456. [paper]
  52. Badrish Chandramouli, Jun Yang, and Amin Vahdat. "Distributed network querying with bounded approximate caching." In Proceedings of the 2006 International Conference on Database Systems for Advanced Applications, pages 374-388, Singapore, April 2006. Acceptance rate: 24.5 percent. [paper and report]
  53. Pankaj K. Agarwal, Junyi Xie, Jun Yang, and Hai Yu. "Monitoring continuous band-join queries over dynamic data." In Proceedings of the 2005 International Symposium on Algorithms and Computation, pages 349-359, Sanya, Hainan, China, December 2005. [paper]
  54. Hao He, Haixun Wang, Jun Yang, and Philip S. Yu. "Compact reachability labeling for graph-structured data." In Proceedings of the 2005 International Conference on Information and Knowledge Management, pages 594-601, Bremen, Germany, November 2005. Acceptance rate: 76 out of 425. [paper and report]
  55. Kamesh Munagala, Jun Yang, and Hai Yu. "Online view maintenance under a response-time constraint." In Proceedings of the 2005 European Symposium on Algorithms, pages 677-688, Palma de Mallorca, Spain, October 2005. [paper]
  56. Wenfei Fan, Zhaohui Wu, and Jun Yang, ed. Proceedings of the 2005 International Conference on Web-Age Information Management, Hangzhou, China, October 2005. Lecture Notes in Computer Science 3739. Springer.
  57. Junyi Xie, Jun Yang, and Yuguo Chen. "On joining and caching stochastic streams." In Proceedings of the 2005 ACM SIGMOD International Conference on Management of Data, pages 359-370, Baltimore, Maryland, USA, June 2005. Acceptance rate: 65 out of 431. [paper and report]
  58. Adam Silberstein, Hao He, Ke Yi, and Jun Yang. "BOXes: efficient maintenance of order-based labeling for dynamic XML data." In Proceedings of the 2005 International Conference on Data Engineering, pages 285-296, Tokyo, Japan, April 2005. Acceptance rate: 67 out of 521. [paper and report]
  59. Hao He, Junyi Xie, Jun Yang, and Hai Yu. "Asymmetric batch incremental view maintenance." In Proceedings of the 2005 International Conference on Data Engineering, pages 106-117, Tokyo, Japan, April 2005. Acceptance rate: 67 out of 521. [paper]
  60. Junfei Geng and Jun Yang. "AutoBib: automatic extraction of bibliographic information on the Web." In Proceedings of the 2004 International Database Engineering and Applications Symposium, pages 193-204, Coimbra, Portugal, July 2004. [paper]
  61. Ke Yi, Hao He, Ioana Stanoi, and Jun Yang. "Incremental maintenance of XML structural indexes." In Proceedings of the 2004 ACM SIGMOD International Conference on Management of Data, pages 491-502, Paris, France, June 2004. Acceptance rate: 69 out of 431. [paper]
  62. Adam Silberstein and Jun Yang. "NeXSort: sorting XML in external memory." In Proceedings of the 2004 International Conference on Data Engineering, pages 695-706, Boston, Massachusetts, USA, April 2004. Acceptance rate: 63 out of 441. [paper and report]
  63. Hao He and Jun Yang. "Multiresolution indexing of XML for frequent queries." In Proceedings of the 2004 International Conference on Data Engineering, pages 683-694, Boston, Massachusetts, USA, April 2004. Acceptance rate: 63 out of 441. [paper and report]
  64. Jun Yang and Jennifer Widom. "Incremental computation and maintenance of temporal aggregates." The VLDB Journal, 12(3):262-283, 2003. [paper]
  65. Zhiyuan Chen, Li Chen, Jian Pei, Yufei Tao, Haixun Wang, Wei Wang, Jiong Yang, Jun Yang, and Donghui Zhang. "Recent progress on selected topics in database research: a report by nine young chinese researchers working in the united states." Journal of Computer Science and Technology, 18(5):538-552, September 2003.
  66. Pankaj K. Agarwal, Lars Arge, Jun Yang, and Ke Yi. "I/O-efficient structures for orthogonal range-max and stabbing-max queries." In Proceedings of the 2003 European Symposium on Algorithms, pages 7-18, Budapest, Hungary, September 2003.
  67. Xiao Huang, Qiang Xue, and Jun Yang. "TupleRank and implicit relationship discovery in relational databases." In Proceedings of the 2003 International Conference on Web-Age Information Management, pages 445-457, Chengdu, China, August 2003. Acceptance rate: 30 out of 258. [paper and report]
  68. Ke Yi, Hai Yu, Jun Yang, Gangqiang Xia, and Yuguo Chen. "Efficient maintenance of materialized top-k views." In Proceedings of the 2003 International Conference on Data Engineering, pages 189-200, Bangalore, India, March 2003. Acceptance rate: 51 out of 378. [paper and report]
  69. Jun Yang. "Temporal data warehousing." Ph.D. Dissertation, Stanford University, August 2001.
  70. Jun Yang and Jennifer Widom. "Incremental computation and maintenance of temporal aggregates." In Proceedings of the 2001 International Conference on Data Engineering, pages 51-60, Heidelberg, Germany, April 2001. Acceptance rate: 14 percent. Results in this paper are subsumed by those in the 2003 VLDB Journal paper by the same authors
  71. Wilburt Juan Labio, Jun Yang, Yingwei Cui, Hector Garcia-Molina, and Jennifer Widom. "Performance issues in incremental warehouse maintenance." In Proceedings of the 2000 International Conference on Very Large Data Bases, pages 461-472, Cairo, Egypt, September 2000. Acceptance rate: 53 out of 351.
  72. Jun Yang, Huacheng C. Ying, and Jennifer Widom. "TIP: a temporal extension to informix." In Proceedings of the 2000 ACM SIGMOD International Conference on Management of Data, page 596, Dallas, Texas, USA, May 2000. Demonstration track.
  73. Jun Yang, Huacheng C. Ying, and Jennifer Widom. "TIP: a temporal extension to informix." In Proceedings of the 2000 International Conference on Extending Database Technology, Konstanz, Germany, March 2000. Demonstration track. An improved version was shown in SIGMOD 2000.
  74. Jun Yang and Jennifer Widom. "Temporal view self-maintenance." In Proceedings of the 2000 International Conference on Extending Database Technology, pages 395-412, Konstanz, Germany, March 2000. Acceptance rate: 16.7 percent.
  75. Hector Garcia-Molina, Wilburt Juan Labio, and Jun Yang. "Expiring data in a warehouse." In Proceedings of the 1998 International Conference on Very Large Data Bases, pages 500-511, New York City, New York, USA, August 1998. Acceptance rate: 16 percent.
  76. Jun Yang and Jennifer Widom. "Maintaining temporal views over non-temporal information sources for data warehousing." In Proceedings of the 1998 International Conference on Extending Database Technology, pages 389-403, Valencia, Spain, March 1998. Acceptance rate: 32 out of 191.
  77. Laura M. Haas, Donald Kossmann, Edward L. Wimmers, and Jun Yang. "Optimizing queries across diverse data sources." In Proceedings of the 1997 International Conference on Very Large Data Bases, pages 276-285, Athens, Greece, August 1997. Acceptance rate: 16 percent.
  78. Laura M. Haas, Donald Kossmann, Edward L. Wimmers, and Jun Yang. "An optimizer for heterogeneous systems with non-standard data and search capabilities." IEEE Data Engineering Bulletin, 19(4):37-44, December 1996.
  79. Steve G. Steinberg, Jun Yang, and Katherine A. Yelick. "Performance modeling and composition: a case study in cell simulation." In Proceedings of the 1996 International Parallel Processing Symposium, pages 68-74, Honolulu, Hawaii, USA, April 1996. Acceptance rate: 35 percent.
Technical reports:

Funding

Current funding: Pending proposals: Past funding:

Honors and Awards

External Presentations and Demonstrations

  1. "From Answering Questions to Questioning Answers (and Questions): Toward Computational Fact-Checking," presentation at MIT, Big Data Initiative, May 2014.
  2. "Big Data: Not Just about the Size," presentation at the Forum of Future Data, Wuyishan, China, July 2012.
  3. "Problems in Computational Journalism," presentation at HP Labs, Beijing, China, June 2012.
  4. "Fun with Arrays and Matrices in RIOT," informal talk at Stanford InfoLab lunch, August 2011.
  5. "Computational Journalism: A Call to Arms to Database Researchers," presentation at the 2011 Conference on Innovative Data Systems Research (CIDR 2011), January 2011.
  6. "Scalable Continuous Query Processing and Result Dissemination," seminar at HP Labs, Beijing, China, August 2010.
  7. "Data-Driven Processing in Sensor Networks," seminar at Stanford University, January 2009.
  8. "A Sampling-Based Approach to Information Recovery," presentation at the 2008 Annual Meeting of the Institute for Operations Research and the Management Sciences (INFORMS 2008), October 2008.
  9. "Thoughts on Data Sharing: A Database Researcher's Perspective," presentation at the Primate Life History Working Group Meeting, NESCent (National Evolutionary Synthesis Center), August 2007.
  10. "Query Suspend and Resume," presentation at the 2007 ACM SIGMOD International Conference on Management of Data (SIGMOD 2007), June 2007.
  11. "Data-Driven Processing in Sensor Networks," seminars at University of Pennsylvania, University of Waterloo, and New England Database Society, April 2007 - October 2007.
  12. "Scalable Continuous Query Processing and Result Dissemination," seminars at IBM T. J. Watson Research Center, University of Maryland at College Park, University of Pittsburgh/Carnegie Mellon University Joint Database Seminar, Brown University, University of Illinois at Urbana-Champaign, and University of California at Berkeley, February 2006 - December 2006.
  13. "Continuous Query Processing over Networked Data," presentation at IBM Research Triangle Park University Day, October 2006.
  14. Panel discussion at SIGMOD '06 Life after Graduation Symposium, June 2006.
  15. "Scalable Continuous Query Processing and Result Dissemination," talk at the 2006 Southeast Workshop on Data and Information Management (SEWDIM 2006), March 2006.
  16. "Querying Networked Data," presentation at IBM Research Triangle Park University Day, October 2005.
  17. "An Overview of Database Research at Duke," presentation at inDuke Meeting, Duke University, May 2005.
  18. "Caching for Network Querying," presentation at SIGMOD '05 Program Committee Workshop, Stanford, California, February 2005.
  19. "Layers and Boxes: Efficient and Maintainable Indexes for XML," seminar at IBM T. J. Watson Research Center, July 2004.
  20. "AutoBib: Automatic Extraction of Bibliographic Information on the Web," presentation at the 2004 International Database Engineering and Applications Symposium (IDEAS 2004).
  21. "Post-Web-Age Information Management," panel discussion at the 2003 International Conference on Web-Age Information Management (WAIM 2003).
  22. "TupleRank and Implicit Relationship Discovery in Databases," presentation at the 2003 International Conference on Web-Age Information Management (WAIM 2003).
  23. "Problems in Database View Maintenance and Web Data Extraction," seminar at University of North Carolina at Greensboro, April 2003.
  24. "Efficient Maintenance of Materialized Top-k Views," presentation at the 2003 International Conference on Data Engineering (ICDE 2003).
  25. "Incremental Computation and Maintenance of Temporal Aggregates," presentation at the 2001 International Conference on Data Engineering (ICDE 2001).
  26. "Query Processing in Kidar," guest lecture for a course on database system implementation at Stanford University, Stanford, California, November 2000.
  27. "Performance Issues in Incremental Warehouse Maintenance," presentation at the 2000 International Conference on Very Large Data Bases (VLDB 2000).
  28. "TIP: A Temporal Extension to Informix," system demonstration at the 2000 ACM SIGMOD International Conference on Management of Data (SIGMOD 2000).
  29. "Temporal Data Warehousing," colloquia at Brown University, Cornell University, Duke University, Harvard University, Santa Clara University, State University of New York at Stony Brook, University of California at Santa Barbara, University of California at Santa Cruz, University of Southern California, Yale University, and IBM Almaden Research Center, February 2000 - May 2000.
  30. "TIP: A Temporal Extension to Informix," presentation and system demonstration at Stanford Database Workshop, Stanford, California, March 2000.
  31. "TIP: A Temporal Extension to Informix," presentation and system demonstration at Informix Corporation, Oakland, California, March 2000.
  32. "Temporal View Self-Maintenance," presentation at the 2000 International Conference on Extending Database Technology (EDBT 2000).
  33. "TIP: A Temporal Extension to Informix," system demonstration at the 2000 International Conference on Extending Database Technology (EDBT 2000).
  34. "Maintaining Temporal Views Over Non-Temporal Information Sources For Data Warehousing," presentation at the 1998 International Conference on Extending Database Technology (EDBT 1998).
  35. "Performance Modeling and Composition: A Case Study in Cell Simulation," presentation at the 1996 International Parallel Processing Symposium (IPPS 1996).

Teaching

Student Advising

Current Ph.D. students: Graduated Ph.D. students: Graduated M.S. students: Undergraduate theses supervised: Undergraduate research internship: Undergraduate independent studies: Ph.D. defense committee (not as primary advisor): Ph.D. preliminary exam committee (not as primary advisor): Ph.D. research initiation project committee (not as primary advisor): M.S. committee (not as primary advisor): Undergraduate thesis committee (not as primary advisor):

Activities

Service to the professional community:
  1. Subject Area Editor (Database and Knowledge-Based Systems), Journal of Computer Science and Technology (JCST), December 2011 - present.
  2. Program Committee Group Leader, the 2015 ACM SIGMOD International Conference on Management of Data (SIGMOD 2015).
  3. General Co-Chair, the 2015 International Conference on Web-Age Information Management (WAIM 2015).
  4. Program Committee, the 2014 International Conference on Information and Knowledge Management (CIKM 2014).
  5. Review Board, Proceedings of the VLDB Endowment, August 2008 - March 2012 and April 2013 - present.
  6. Program Committee Co-Chair, the 2014 International Workshop on Bringing the Value of Big Data to Users (DATA4U 2014).
  7. Best Paper Selection Committee, the 2014 National Database Conference of China (NDBC 2014).
  8. Program Committee, the 2014 ACM SIGMOD International Conference on Management of Data (SIGMOD 2014).
  9. Program Committee, the 2014 International Workshop on Exploratory Search in Databases and the Web (EXPLOREDB 2014).
  10. Senior Program Committee, the 2013 International Conference on Information and Knowledge Management (CIKM 2013).
  11. Demonstration Program Committee Co-Chair, the 2013 International Conference on Very Large Data Bases (VLDB 2013).
  12. Best Paper Selection Committee, the 2013 National Database Conference of China (NDBC 2013).
  13. Program Committee Area Chair (Streams, Sensor Networks, Complex Event Processing), the 2013 ACM SIGMOD International Conference on Management of Data (SIGMOD 2013).
  14. Associate Editor, IEEE Transactions on Knowledge and Data Engineering (TKDE), March 2009 - March 2013.
  15. Publicity Co-Chair, the 2013 International Conference on Database Systems for Advanced Applications (DASFAA 2013).
  16. Panel Co-Chair, the 2013 International Conference on Data Engineering (ICDE 2013).
  17. Senior Program Committee, the 2012 International Conference on Information and Knowledge Management (CIKM 2012).
  18. Best Paper Selection Committee, the 2012 National Database Conference of China (NDBC 2012).
  19. Program Committee, the 2012 ACM SIGMOD International Conference on Management of Data (SIGMOD 2012).
  20. Program Committee, the 2012 International Conference on Data Engineering (ICDE 2012).
  21. Program Committee, the 2011 International Conference on Data Engineering (ICDE 2011).
  22. Program Committee, the 2011 Conference on Innovative Data Systems Research (CIDR 2011).
  23. Program Committee, the 2010 International Conference on Very Large Data Bases (VLDB 2010).
  24. Program Committee, the 2010 International Workshop on Data Management for Sensor Networks (DMSN 2010).
  25. Program Committee Co-Chair, the 2010 International Conference on Web-Age Information Management (WAIM 2010).
  26. Program Committee, the 2010 International Conference on Data Engineering (ICDE 2010).
  27. Program Committee, the 2010 International Workshop on Ranking in Databases (DBRANK 2010).
  28. Program Committee, the 2009 International Workshop on Cloud Data Management (CLOUDDB 2009).
  29. Program Committee, the 2009 IFIP/ACM International Conference on Distributed Systems Platforms (MIDDLEWARE 2009).
  30. Program Committee, the 2009 ACM SIGMOD International Conference on Management of Data (SIGMOD 2009).
  31. Program Committee, the 2009 ACM Workshop on Data Engineering for Wireless and Mobile Access (MOBIDE 2009).
  32. Program Committee, the 2009 International Workshop on Scalable Stream Processing Systems (SSPS 2009).
  33. Regional Chair (America), the 2009 International Conference on Database Systems for Advanced Applications (DASFAA 2009).
  34. Program Committee, the 2009 International Conference on World Wide Web (WWW 2009).
  35. Program Committee, the 2009 International Workshop on Ranking in Databases (DBRANK 2009).
  36. Program Committee, the 2009 International Conference on Data Engineering (ICDE 2009).
  37. Program Committee, the 2009 Conference on Innovative Data Systems Research (CIDR 2009).
  38. Steering Committee Member, International Conference on Web-Age Information Management (WAIM), September 2008 - present.
  39. General Co-Chair and Program Committee Member, the 2008 International Workshop on Data Management for Sensor Networks (DMSN 2008).
  40. Program Committee, the 2008 International Conference on Information and Knowledge Management (CIKM 2008).
  41. Program Committee, the 2008 ACM Workshop on Data Engineering for Wireless and Mobile Access (MOBIDE 2008).
  42. Program Committee, the 2008 International Conference on Web-Age Information Management (WAIM 2008).
  43. Program Committee, the 2008 IEEE International Conference on Computational Science and Engineering (CSE 2008).
  44. Program Committee, the 2008 International Workshop on Scalable Stream Processing Systems (SSPS 2008).
  45. Program Committee, the 2008 International Conference on Very Large Data Bases (VLDB 2008).
  46. Program Committee, the 2008 ACM SIGMOD International Conference on Management of Data (SIGMOD 2008).
  47. Program Committee, the 2008 International Conference on Data Engineering (ICDE 2008).
  48. Program Committee Co-Chair, the 2007 International Workshop on Data Management for Sensor Networks (DMSN 2007).
  49. Demonstration Program Committee, the 2007 International Conference on Very Large Data Bases (VLDB 2007).
  50. Program Committee, the 2007 International Conference on Scalable Information Systems (INFOSCALE 2007).
  51. Program Committee, the 2007 International Symposium on Large Spatio-Temporal Databases (SSTD 2007).
  52. Program Committee, the 2007 Joint Conference of the Asia-Pacific Web Conference and the International Conference on Web-Age Information Management (APWEBWAIM 2007).
  53. Program Committee, the 2007 ACM SIGMOD International Conference on Management of Data (SIGMOD 2007).
  54. Program Committee, the 2007 ACM SIGMOD International Conference on Management of Data (SIGMOD 2007) Ph.D. Workshop on Innovative Database Research.
  55. Program Committee, the 2007 Workshop on Networking Meets Databases (NETDB 2007).
  56. Program Committee, the 2007 International Workshop on Scalable Stream Processing Systems (SSPS 2007).
  57. Program Committee, the 2007 International Conference on Data Engineering (ICDE 2007).
  58. Program Committee, the 2006 International Conference on Information and Knowledge Management (CIKM 2006).
  59. Program Committee, the 2006 International Conference on Geosensor Networks (GSN 2006).
  60. Program Committee, the 2006 International Workshop on Data Management for Sensor Networks (DMSN 2006).
  61. Program Committee, the 2006 International XML Database Symposium (XSYM 2006).
  62. Program Committee, the 2006 International Conference on Very Large Data Bases (VLDB 2006) Ph.D. Workshop.
  63. Program Committee Co-Chair, the 2006 Southeast Workshop on Data and Information Management (SEWDIM 2006).
  64. Program Committee, the 2006 International Conference on Web-Age Information Management (WAIM 2006).
  65. Program Committee, the 2005 International Conference on Data Mining (ICDM 2005).
  66. Program Committee, the 2005 ACM International Workshop on Web Information and Data Management (WIDM 2005).
  67. Program Committee, the 2005 ACM SIGMOD International Conference on Management of Data (SIGMOD 2005).
  68. Program Committee, the 2005 International XML Database Symposium (XSYM 2005).
  69. Program Committee, the 2005 International Conference on Very Large Data Bases (VLDB 2005) Ph.D. Workshop.
  70. Program Committee, the 2005 International Conference on Database Systems for Advanced Applications (DASFAA 2005).
  71. Publications Chair, the 2005 International Conference on Web-Age Information Management (WAIM 2005).
  72. Program Committee, the 2004 International Conference on Data Mining (ICDM 2004).
  73. Program Committee, the 2004 International Conference on Very Large Data Bases (VLDB 2004).
  74. Program Committee, the 2004 ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (SIGKDD 2004).
  75. Demonstration Program Committee, the 2004 ACM SIGMOD International Conference on Management of Data (SIGMOD 2004).
  76. Participant of the Summer Workshop on Developing the Field of Computational Journalism, Center for Advanced Study in Behavioral Sciences, Stanford, California, July 2009.
  77. Panelist for NSF, IIS Division, 2003, 2004, 2005, 2008, 2009, 2010, 2011.
  78. Panelist for Department of Homeland Security, 2006.
  79. Expert Panelist on Cancer Reporting Information Technology, Office of the Assistant Secretary for Planning and Evaluation, Department of Health and Human Services, 2008 - 2009.
  80. Reviewer for Research Grants Council of Hong Kong, 2010, 2012.
  81. Reviewer for Natural Sciences and Engineering Research Council of Canada, 2008.
  82. Reviewer for Netherlands Organisation for Scientific Research, 2006.
  83. Associate Information Director, ACM SIGMOD, 2003 - present.
  84. Started Carolina Database Research Group (CDB) in 2003 with a group of database researchers in North Carolina and continue to be one of the main organizers.
  85. Publicity Chair, the 2004 International Conference on Mobile Data Management (MDM 2004).
  86. Reviewers for journals: ACM Transactions on Database Systems (TODS), The VLDB Journal (VLDBJ), IEEE Transactions on Knowledge and Data Engineering (TKDE), ACM Transactions on Programming Languages and Systems (TOPLAS), ACM SIGMOD Record (SIGMODREC), The Computer Journal (CJ), Information and Computation (IC), Information Processing Letters (IPL), IEEE Transactions on Mobile Computing (TMC), Data and Knowledge Engineering (DKE), IEEE Internet Computing (INTERNET), Information and Software Technology (IST), Journal of Systems and Software (JSS), Knowledge and Information Systems (KAIS), Ad Hoc and Sensor Wireless Networks (AHSWN), Journal of Research and Practice in Information Technology (JRPIT), Journal of Computer Science and Technology (JCST), Distributed and Parallel Databases (DPDB), International Journal of Computer Systems Science and Engineering (CSSE), LNCS Journal on Data Semantics (JODS), Electronics and Telecommunications Research Institute Journal (ETRI), Proceedings of the IEEE (PIEEE).
  87. Reviewers for conferences: ACM SIGMOD International Conference on Management of Data (SIGMOD), International Conference on Very Large Data Bases (VLDB), International Conference on Data Engineering (ICDE), ACM Symposium on Principles of Database Systems (PODS), International Conference on World Wide Web (WWW), International Conference on Information and Knowledge Management (CIKM), International Workshop on the Web and Databases (WEBDB), ACM Symposium on Cloud Computing (SOCC), International Symposium on Theoretical Aspects of Computer Science (STACS), European Symposium on Algorithms (ESA), International Conference on Distributed Computing Systems (ICDCS), International Conference on Mobile Systems, Applications, and Services (MOBISYS), USENIX Annual Technical Conference (USENIX), ACM Symposium on Parallel Algorithms and Architectures (SPAA).
  88. Designer of the ACM SIGMOD logo, IEEE Data Engineering logo, Stanford InfoLab's old logo, VLDB 2011 logo, and a number of others.
Service to Duke University and the Department of Computer Science: Other activities: