[Corpora-List] Summer Intern/Large Scale Graph Mining

Ang Sun asun at cs.nyu.edu
Sat Feb 9 17:29:00 CET 2013

Description http://ch.tbe.taleo.net/CH06/ats/careers/requisition.jsp?org=INTELIUSCORP&cws=39&rid=231

inome is gathering the world’s information and making it people centric. The inome graph connects billions of entities (people, organizations and addresses) to encode the information-genome of each individual. inome Research develops cutting-edge systems to extract, standardize, link and create intelligence to power inome’s industry-leading people search engine and platform development environment. inome research is a team of scientists with vast expertise in Record Linkage, Natural Language Processing, Entity Resolution, Data Deduplication, Machine Learning and Information Retrieval. The internship will be at our headquarters in Bellevue, WA, and offer a competitive compensation.


This position will explore large scale graph algorithms for problems interfacing people search and data record linkage using billions of person records derived from sources ranging from public social network profiles to phone books. You will design advanced algorithms and implement them to run on a large Hadoop cluster and monitor the quality of inome’s person matching system. You will also design and develop tools on top of the inome entity graph by exploring the entities and their connections. Sample projects include Recommendation Systems, Community Detection, Finding Influential People, etc. This is likely to be innovative work and we expect the summer intern to improve/extend our products and publish a paper at a top conference.

Required Skills:

Pursuing Ph.D. in Computer Science with a focus on large scale graphs, graph mining, social media analytics, data mining, machine learning, natural language processing or related fields Experience with large scale graph algorithms, clustering, page-rank, and community detection Strong hands-on skills in object-oriented design methodology and application development in Java Proficiency in at least one of Perl, PHP, Python Basic understanding of Hadoop and/or MapReduce Familiarity with large-scale, distributed systems’ backend architecture and development Familiarity with graph based machine learning toolkits such as GraphLab Familiarity with graph databases, preferably having hands on experience in neo4j, or InfiniteGraph Familiarity with graph query languages

Desired Skills: Interest in solving problems with big data collected from various sources Excellent understanding of computer science fundamentals, data structures, and algorithms Excellent problem solving skills Past publications on graph algorithms, social networks, or related field Familiarity with Amazon Mechanical Turk or other human evaluation systems

Contact: Please apply online at http://ch.tbe.taleo.net/CH06/ats/careers/apply.jsp?org=INTELIUSCORP&cws=39 or send your CV to Ang Sun, asun at inome.com

More information about the Corpora mailing list