About Me

I am a PhD candidate under the supervision of Professor Juliana Freire at NYU School of Engineering in Computer Science. In the past summers, I worked with Data Science groups at Los Alamos National Laboratory and IBM Watson Research Center as a research intern. Before joining NYU, I was a research engineer at VNG and a founding engineer at Wala. I got my bachelor degree in Computer Science from Hanoi University of Science and Technology.


My areas of interests include information retrieval and machine learning. Since 2014, I has been working on the DARPA Memex project, in which my research focuses on crawling and machine learning techniques for discovering domain-specific content from the Web.


  • Learning to Discover Domain-Specific Web Content
    Kien Pham, Aecio Santos, Juliana Freire.
    The 11th ACM International Conference on Web Search and Data Mining (WSDM 2018)
    (Acceptance rate: 16%)

  • Real-Time Understanding of Humanitarian Crises via Targeted Information Retrieval
    Kien Pham, Prasanna Sattigeri, Amit Dhurandhar, Arpith Jacob, Maja Vukovic, Patrice Chataigner, Juliana Freire, Aleksandra Mojsilović, and Kush R. Varshney.
    IBM Journal of Research and Development (IBM Journal 2017)

  • Understanding website behavior based on user agent.
    Kien Pham, Aecio Santos, Juliana Freire.
    The 39th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2016)

  • Interactive Exploration for Domain Discovery on the Web
    Yamuna Krishnamurthy, Kien Pham, Aecio Santos, Juliana Freire.
    KDD Workshop on Interactive Data Exploration and Analytics (IDEA 2016)

  • Structured open urban data: understanding the landscape
    Luciano Barbosa, Kien Pham, Claudio Silva, Marcos R Vieira, Juliana Freire
    (Big Data Journal 2016)

  • The more the merrier: Efficient multi-source graph traversal
    Manuel Then, Moritz Kaufmann, Fernando Chirigati, Tuan-Anh Hoang-Vu, Kien Pham, Alfons Kemper, Thomas Neumann, Huy T Vo
    The 40th International Conference on Very Large Data Bases (VLDB 2014)

  • Leveraging Multilingual Content to Identify and Reconcile Inconsistencies in Wikipedia.
    Kien Pham, Fernando Chrigati, Luciano Barbosa, Juliana Freire.
    Technical Report for PhD Qualifying Exam, NYU School of Engineering 2014

  • New hybrid genetic algorithm for solving optimal communication spanning tree problem
    Kien Pham, Hiep Nguyen, Binh Huynh
    The 26th International Symposium on Applied Computing (SAC 2011)


  • Humanitarian Crisis Analysis Using Secondary Information Gathered by a Focused Web Crawler. Filed April 11, 2017
    Ioana Baldini, Amit Dhurandhar, Abhishek Kumar, Aleksandra Mojsilović, Kien Pham, Kush R. Varshney, and Maja Vukovic.


Last-modified: Oct 2017 · Jekyll academic template