Hello Everyone,

The Smithsonian Libraries seeks a computer science or MLS student for the Taxonomic Literature 2 Linked Data Mining internship. This is a paid internship, carrying a stipend of $500 per week (full time) or a total of $1500 (part time) to take place in January/Febuary of 2013. It may be performed in person, in the National Museum of Natural History, in Washington, D.C. or remotely. Applications will be accepted until October 15th, 2012. Further project details are below or at http://library.si.edu/internships/taxonomic-literature-2-linked-data-mining-paid-internship.

Dates preferred: Winter term (January-February) 2013

Full time or Part time: Either full time for three weeks or part time, totally 105 hours. This is a paid internship, carrying a stipend of $500 per week (full time) or a total of $1500.

Intern Supervisor: Joel Richard

Location of internship: Remote or Local (Washington, DC)               

Desired knowledge/skill sets:

One of:  B.S. in Computer Science or related field OR MLS/MLIS  current student or recent graduate (within 6 months)

Must have: Experience with databases or large datasets, knowledge of at least one programming language e.g., Ruby, Python, Perl, etc.

Desirable: experience or education in the Natural Sciences


Brief description of project:

TL-2 is the premier publication of the International Association for Plant Taxonomy (IAPT); a 15 volume guide to the literature of systematic botany published between 1753 and 1940.  It is organized by author and includes numbered entries for the author's publications. Suggested abbreviations for use in taxonomic publications are provided: abbreviations for the author's name, short titles and abbreviations of the short titles for publications. TL-2 is the standard by which authors' names and titles should be abbreviated.

TL-2 is now being offered online as a searchable database at http://www.sil.si.edu/digitalcollections/tl-2/. The plan is to provide TL-2 as linked open data (LOD) to increase utility for the Botany community. 

Possible activities to explore in this internship include one or more of:

End results of the internship will be at least two of:

If the internship is remote, frequent check-ins via Skype or GTalk (or phone) will be the primary means of communication. The internship can be full-time or part-time (20 hrs/week or more) with total time spent on the project not to exceed 105 hours.

Please apply via SOLAA (https://solaa.si.edu/solaa/SOLAAHome.html). Select "Smithsonian Insitution Libraries" as placement unit, then "Smithsonian Institution Libraries Internship Program" as program and "Taxonomic Literature 2 Linked Data Mining" as specific project. Paper and email applications will not be accepted.



Erin Clements Rushing

Digital Images Librarian
Digital Services Division, Smithsonian Institution Libraries
Room 2206  MRC 154
10th Street and Constitution Ave, NW
Washington, DC 20013-7012
p. 202.633.1708
f. 202.633.4313
[log in to unmask]