Skip to main content

Data curation.

In the digital age, the process of data curation is crucial, as it guarantees the accuracy, completeness, and trustworthiness of information across diverse databases.

OpenOrgs streamlines this process by identifying and resolving potential data duplicates, thus enhancing the integrity of the data related specifically to organisations.

This allows data curators to manually refine the information, ensuring that organisational records are precise and clear.

OpenAIRE is a non-profit organization with a mission to promote open scholarship and improve discoverability, accessibility, shareability, reusability, reproducibility, and monitoring of data-driven research results, globally. The OpenAIRE Graph is a key element of this initiative, serving as a comprehensive collection of metadata related to research outputs, projects, organisations, and researchers. The curation work in OpenOrgs plays a significant role in enhancing the functionality and accuracy of the Graph, contributing in several ways:

Data quality and accuracy

OpenOrgs ensures the data about research organisations is accurate and up-to-date, enhancing the reliability of the OpenAIRE Graph.

Enhanced discoverability

The disambiguation of information about research entities improves the discoverability of related research outputs, projects, and datasets.

Interlinking of Research Outputs

Accurate linking of research outputs to organizations helps in tracking contributions and collaborations across the research landscape.

Compliance and Reporting

Curation aids organizations in meeting open access mandates and reporting requirements by accurately representing their research outputs.

Facilitating Research Assessments and Metrics

Reliable organizational data contributes to more accurate research performance metrics and assessments.

Supporting the Open Science Movement

By improving data quality and accessibility, OpenOrgs curation supports transparency, reproducibility, and accessibility in research.

How can you help?

roles and responsabilities of data curators
  • All Data Curators, can assist by:

    • Enriching and updating metadata for organisations.
    • Identifying and resolving duplicates, both algorithm-suggested and manually identified.
    • Approving organisations and providing them with stable OpenOrgs IDs.
    • Adding new organisations to the OpenOrgs database.
    • Documenting your insights in the notes section for each organisation.
    • Reviewing and understanding the history of modifications on organisation pages.
  • As a Simple User, your contributions are valuable through:

    • Reviewing and deciding on potential duplicate organisations as indicated by the algorithm.
    • Optionally enhancing metadata while resolving duplicates.
    • Concentrating your efforts on a specific set of countries.
    • Editing and enriching the metadata of organisations.
    • Initiating the creation of new organisation entries, subject to administrative approval.
    • Participating in the approval process for suggestions from the duplicate identification algorithm.
    • As a National Administrator (NOAD), you play a crucial role by:

      • Giving final approval to new organisations, thereby granting them official OpenOrgs IDs.
      • Managing user access and permissions within your nation.
      • Taking charge in resolving any conflicts that occur.
      • Making decisions on pending organisations to finalize their approval status.

    Curation methodology.

    OpenOrgs curation methodology integrates advanced algorithms with expert human oversight,
    ensuring high-precision identification and management of duplicates, along with comprehensive
    metadata enhancement for accurate and detailed organisational records

    01

    Automatic Duplicate Suggestion

    The algorithm periodically suggests potential duplicates for data curators to review. While the algorithm detects similarities, the final decision on whether different digital representations represent the same object requires human precision.

    02

    Manual Duplicate Management

    Beyond automatic detection, data curators manually address ambiguities in OpenAIRE's aggregated organisational data. They follow established procedures to resolve identified duplicates, playing a crucial role in improving data accuracy and quality.

    03

    Metadata Curation

    Within OpenOrgs, data curators have the ability to enhance the metadata description of organisational entities.

    This process addresses information gaps from original sources, significantly boosting the discoverability and completeness of organisational records.

    Official Name
    & Type

    Ensuring the organisation's official name is accurately listed and categorizing the type of organisation (e.g., university, research institute, non-profit).

    Geographical
    Location

    Specifying the organisation's precise geographical location, including city and country, to facilitate regional and global discoverability and collaboration.

    Acronyms
    & Aliases

    Recording official and commonly used acronyms to aid in recognition and searchability.
    Including other names by which the organisation is known, enhancing the ability to find the organisation under different search terms.

    Identifiers

    Incorporating unique identifiers from relevant databases enhances precise and clear identification across various platforms.

    URLs

    Providing official website URLs, contributing to direct accessibility and verification of the organisation's online presence.

    Relations

    Defining hierarchical relationships within larger organisations, such as departments, institutes, and sub-units, clarifying the structure and enhancing the understanding of the organisation's composition and scope.