aggregating multiple sources to pre-populate new repositories

1
DIGITAL PLATFORMS FOR THE DISSEMINATION OF RESEARCH AGGREGATING MULTIPLE SOURCES TO PREPOPULATE NEW REPOSITORIES The MyScienceWork.com Internal Database A GLOBAL SCIENTIFIC PLATFORM gathering 30+ million research publicaHons from: open access sources MSW’s own repositories: POLARIS Publishers (both OA and not) User uploads Fulltext mining DOI management service Journal services AGGREGATION SERVICE Data from different sources are given a confidence rate. Data go through validaSon processes or overwrite others, depending on thee confidence rate that was assigned to them. Highconfidence sources of data There is a long list of services that one can check: the idenSfy of the authors/ insStuSons the metadata the copyright Here we present the process to prepopulate a new repository in the context of MyScienceWork’s POLARIS plaRorms. To centralize and structure metadata, this aggregaHon service uses: an internal database, uploads from users and insHtuHons, several external services The confidence rate assignment process covers both metadata from different sources and authorpublicaSon matching. WHY? The process of aggregaSng structured data from several external, open services to prefill repositories demonstrates how to use exisSng databases, instead of asking librarians and researchers to deposit all their arScles. PREPOPULATION OF REPOSITORIES Challenges: Matching between publicaSons/ authors/insStuSons Date: only publicaSons produced while aXhe insStuSons etc. ADDON TO EXISTING REPOSITORIES This aggregaSon service can be added on to prefill any exisSng insStuSonal repository. BIG DATA We aggregate data from our large community of MSW and POLARIS users. by Let’s go find your publicaHons! Learn more about Polaris FOR WHOM? POLARIS by MyScienceWork is a tailormade soluSon for insStuSons. It offers turnkey plaYorms for enhanced disseminaSon and communicaSon of research. L. Bianchini, D. Vannson, J. Houssière, V. Simon – MyScienceWork [email protected]

Upload: bianchini-laurence

Post on 27-Jul-2015

629 views

Category:

Technology


1 download

TRANSCRIPT

DIGITAL  PLATFORMS  FOR  THE  DISSEMINATION    OF  RESEARCH  

 AGGREGATING  MULTIPLE  SOURCES    

TO  PRE-­‐POPULATE  NEW  REPOSITORIES    

The  MyScienceWork.com  Internal  Database    A  GLOBAL  SCIENTIFIC  PLATFORM  gathering  30+  million  research  publicaHons  from:  •   open  access  sources  •   MSW’s  own  repositories:  POLARIS  •   Publishers  (both  OA  and  not)  •   User  uploads  •   Full-­‐text  mining  •   DOI  management  service    •   Journal  services  

AGGREGATION  SERVICE  Data  from  different  sources  are  given  a  confidence  rate.  Data  go  through  validaSon  processes  or  over-­‐write  others,  depending  on  thee  confidence  rate  that  was  assigned  to  them.    High-­‐confidence  sources  of  data  There  is  a  long  list  of  services  that  one  can  check:  •   the  idenSfy  of  the  authors/insStuSons  •   the  metadata  •   the  copyright  

Here  we  present  the  process  to  pre-­‐populate  a  new  repository  in  the  context  of  MyScienceWork’s  POLARIS  plaRorms.      To  centralize  and  structure  metadata,  this  aggregaHon  service  uses:  •     an  internal  database,  •     uploads  from  users  and  insHtuHons,  •     several  external  services    

 The  confidence  rate  assignment  process  covers  both  metadata  from  different  sources  and  author-­‐publicaSon  matching.  

WHY?    The  process  of  aggregaSng  structured  data  from  several  external,  open  services  to  prefill  repositories  demonstrates  how  to  use  exisSng  databases,  instead  of  asking  librarians  and  researchers  to  deposit  all  their  arScles.  

PREPOPULATION  OF  REPOSITORIES  Challenges:  -­‐  Matching  between  publicaSons/authors/insStuSons  -­‐  Date:  only  publicaSons  produced  while  aXhe  insStuSons  -­‐  etc.  

ADD-­‐ON  TO  EXISTING  REPOSITORIES  This  aggregaSon  service  can  be  added  on  to  prefill  any  exisSng  insStuSonal  repository.  

BIG  DATA  We  aggregate  data  from  our  large  community  of  MSW  and  POLARIS  users.  

 by    

Let’s  go  find  your  publicaHons!    

Learn  more  about  Polaris  

FOR  WHOM?  POLARIS  by  MyScienceWork  is  a    tailor-­‐made  soluSon  for  insStuSons.    It  offers  turnkey  plaYorms  for  enhanced  disseminaSon  and    communicaSon  of  research.    

L.  Bianchini,  D.  Vannson,  J.  Houssière,  V.  Simon  –  MyScienceWork  -­‐  [email protected]