october 1 niso training thursday: using alerting systems to ensure oa policy compliance

30
SHARE Phase II Judy Ruttenberg, Association of Research Libraries Erin Braswell, Center for Open Science Fabian von Feilitzsch, Center for Open Science NISO Virtual Conference: October 1, 2015 Using Alerting Systems to Ensure OA Policy Compliance

Upload: national-information-standards-organization-niso

Post on 08-Jan-2017

430 views

Category:

Education


1 download

TRANSCRIPT

Page 1: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

SHARE Phase IIJudy Ruttenberg, Association of Research Libraries

Erin Braswell, Center for Open ScienceFabian von Feilitzsch, Center for Open Science

NISO Virtual Conference: October 1, 2015Using Alerting Systems to Ensure OA Policy Compliance

Page 2: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Founded by Academic Leaders, Built with Open Technology

Research universities are long-lived and are mission-driven to generate, make accessible, and preserve over time new

knowledge and understanding.

Page 3: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

What is SHARE?

SHARE is building a free, open data set about research and scholarly activities across their life cycle.

Page 4: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Research Lifecycle

Page 5: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Open Science Frameworkhttp://osf.io

Page 6: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 7: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 8: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 9: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 10: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 11: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 12: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 13: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 14: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 15: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 16: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 17: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 18: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance
Page 19: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Using SHARE’s Search API● API is currently a slightly restricted Elasticsearch

instance● You can hit the API with any valid Elasticsearch query● Going to go over some quick and sort of interesting

aggregations that are available

Page 20: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

What it looks like: The query$ curl -XPOST "https://osf.io/api/v1/share/search/" -H 'content-type: application/json' -d '{

"query": {"match_all": {}},

"size": 0,

"aggs": {

"top tags": {

"terms": {

"field": "tags"

}

}

}

}'

Page 21: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

What it looks like: The response{ "count": 2137806, "time": 0.1, "results": [], "aggs": null, "aggregations": { "top tags": { "buckets": [ { "key": "ecological", "doc_count": 20347 }, { "key": "long", "doc_count": 20179 }, { "key": "term", "doc_count": 20021 }, { "key": "lter", "doc_count": 18862 }, { "key": "data", "doc_count": 17086 }, ....

...

{ "key": "research", "doc_count": 16539 }, { "key": "earth", "doc_count": 16395 }, { "key": "water", "doc_count": 16150 }, { "key": "program", "doc_count": 16098 }, { "key": "remote", "doc_count": 15963 } ], "sum_other_doc_count": 1049093, "doc_count_error_upper_bound": 3000 } }}

Page 22: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Kind of a pain

● We have an experimental python library to help cut down the verbosity a bit

Page 23: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Same Example>> from sharepa import ShareSearch>> from sharepa.analysis import bucket_to_dataframe

>> search = ShareSearch()>> search.aggs.bucket('top tags', 'significant_terms', field='tags')

Internal structure is:

{ "query": { "match_all": {} }, "aggs": { "top tags": { "significant_terms": { "field": "tags" } } }}

Page 24: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Now we send the JSON blob to the SHARE search API

>> results = search.execute()

And we get back the same response.

We can then use some of our utilities to convert

the Elasticsearch response to a dataframe

(basically just a table)

>> df = bucket_to_dataframe( 'top tags', results.aggregations['top tags']['buckets'] ).sort('top tags', ascending=False)

and plot it as well:

>> df.plot(kind='bar', x='key', y=['bg_count', 'top tags'])

Page 25: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Cancer

Page 26: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Flu

Page 27: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Influenza

Page 28: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Vaccine

Page 29: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Phase II

Page 30: October 1 NISO Training Thursday: Using Alerting Systems to Ensure OA Policy Compliance

Contact [email protected]

https://osf.io/yg3xj/