Automatically retrieve massive datasets from Scopus

Universities are continuing to develop internal and external evaluation systems. These are aimed to improve the quality and performance of teaching and research.
Reference is often made to dedicated search engines such as Scopus or ISI WebOfScience.

The Mathematica code presented below is aimed to retrieve massive datasets from these search engines; indeed scholars are often ranked relatively to their colleagues in the same scientific area. Hence, to compute his own ranking, one must firstly retrieve the data pertinent to his area.


DigitAfter[page_,target_]:=Module[{tpos=Last[First[StringPosition[page, target]]]},
First[ToExpression[StringCases[StringTake[page, {tpos + 1, tpos + 6}], DigitCharacter..]]]];

authors = {{"7006812927", "Gurtin"}, {"7201905935", "Batra"}};

tab = Table[ipage=Import[StringJoin["http://www.scopus.com/authid/detail.url?authorId=",authors[[i,1]]]];
doc = DigitAfter[ipage, "This author has published"];
cit = DigitAfter[ipage, "This author has been cited by "];
hi = DigitAfter[ipage, "h Index"];
{authors[[i, 2]], doc, cit, Round[10 cit/doc]/10., hi}, {i, Length[authors]}];
TableForm[SortBy[tab, -Part[#, -2] &],
TableHeadings -> {None, {"Author", "Docs", "Cits", "AvCit", "h-Index"}}]

The example which involve only two scholars (namely Gurtin ME and Batra RC) gives you:

Of course you can run the code with a list of as many authors you like once you retrieve their authorid‘s from Scopus.

About these ads

~ by svidoli on December 1, 2011.

Leave a Reply

Fill in your details below or click an icon to log in:

WordPress.com Logo

You are commenting using your WordPress.com account. Log Out / Change )

Twitter picture

You are commenting using your Twitter account. Log Out / Change )

Facebook photo

You are commenting using your Facebook account. Log Out / Change )

Connecting to %s

 
Follow

Get every new post delivered to your Inbox.

%d bloggers like this: