CERN Accelerating science

News, announcements and future developments of CDS services at CERN

Category: CDS-RDM

A new chapter for CDS: the CERN Repository

Sticky post

The last weeks of 2024 marked a big step for the CDS team and all CDS users: the first migration of CDS content to a new version of the platform—we have just completed the migration of the first collection—Summer Student Programme reports. We can now consider the CERN repository a production service.

This moment is historic not only for our service, but also for CERN.

A bit of history…

CDS was established over 20 years ago as the institutional repository of CERN, tasked with archiving, preserving, and disseminating the organization’s research output, administrative documents, and multimedia. It is powered by the Invenio framework version 1, which has also been adopted by numerous repositories worldwide.

While CDS has dutifully served the CERN community for many years, it has become evident that it requires a refresh in terms of user experience and available features to meet the demands of 2025 and future.

Drawing from the successful experience gained with Zenodo, we have developed InvenioRDM: a versatile digital repository designed for researchers. InvenioRDM has been created collaboratively with over 20 partners around the globe, and incorporates all the best FAIR (Findable, Accessible, Interoperable, and Reusable) practices, offering a modern user experience.

The CERN repository: a new CDS

Screenshot of the new CDS website

The CDS platform, currently accessible at https://repository.cern, serves as a new version of previous CDS website and is built on InvenioRDM.

Over the forthcoming months, our focus will be on tailoring it to suit the requirements of an Institutional Repository while integrating features specific to CERN. In parallel, we will migrate the content of each collection, one by one, until the end.

How?

We have developed a migration plan spanning 2024 and 2025, with a subsequent phase planned for 2026. This long and complex migration, set to unfold over several years, will be guided by three core principles:

  • Engage: before initiating the migration of any collaboration using CDS, we will actively engage with its members. Our goal is to ensure a smooth transition that meets the needs of each community without disrupting their work. We will thoroughly analyze use cases and collaboratively establish timelines.
  • Simplify: we aim to make submitting content easier and more intuitive. By empowering users with tools to independently organize and curate their materials, we will enhance the overall user experience.
  • Standardize: we will adopt standardized content metadata practices to align with FAIR principles and Open Science best practices

Next steps

When will my content be migrated? Where should I upload new documents? Who should I contact?
Don’t worry—these are questions we plan to answer together with you. We are committed to working closely with all content owners in CDS, gradually engaging with each group to share our plans and shape the future collaboratively.

Following the successful migration of the Summer Student reports, we’ve validated our migration processes and pipelines. Building on this success, we are now ready to tackle more complex challenges, with the next milestone being the migration of CERN Theses.

In parallel, we aim to explore the feasibility of bulk migrating content from small and medium experiments. Additionally, we plan to prototype a new review and comment workflow to address the needs of most users.

Keeping You in the Loop

This migration is an ongoing learning process, and we haven’t figured everything out yet! Your support and feedback are crucial to our success.

We’ve recently started documenting our migration journey and compiling information on a dedicated website, which is continuously evolving. Additionally, we’ll keep sharing updates and news right here.

If you have questions or concerns, don’t hesitate to reach out—we’re here to help.

Improved DOI, ORCiD, ROR and CERN integrations

Our new CDS repository is now more FAIR and more integrated with CERN services. Read below how.

DOI – do I need one?

A DOI (Digital Object Identifier) is like a permanent address for something on the internet, such as a research paper, article, or video. It’s a unique code that never changes, even if the location of the content moves to a new website.

Think of it like a tracking number for a package: no matter where the package goes, you can always find it using the number. Similarly, a DOI helps you find a document online, even if its web link breaks or changes.

For example, a DOI might look like this:
https://doi.org/10.17181/dd19c-hwf65

When you click on it, it will always take you to the right place to access the content. This makes DOIs very useful for researchers, students, and anyone trying to find, share or cite information without worrying about broken links.

You do need, and you should prefer a DOI instead of other IDs, if:

  • the page of your document is public (the file might be restricted)
  • you want to keep permanent access to your article/paper/resource
  • you want any other researcher to be able to make references to your content (citations)
  • you want to share the content with the research community without worries about broken links

You do not need a DOI if:

  • your document is internal, fully restricted to to your collaboration
  • you do not need anyone to be able to cite your work/document
  • your document does not have research related nature

Choose your DOI option

Latest CDS upgrade, we introduced optional DOIs. When you are uploading a document, you can choose to:

  1. Input an external DOI (for example, already provided by Zenodo.org, ArXiV.org or by a journal).
  2. Obtain a new DOI, registered by CDS.
  3. Avoid the creation of any DOI.
DOI selection options
Obtaining a DOI

Find my colleagues

Since we are on the subject of submitting the content to https://repository.cern (aka CDS), how about providing the most accurate metadata possible, without hustle?

You can now easily find all of your CERN colleagues and external researchers with ORCiD profile when listing them as authors or contributors to your work.

But wait, what is ORCiD?

An ORCiD (Open Researcher and Contributor ID) is like a unique ID card for researchers (similar to DOI for research resources). It’s a special number that helps identify scientists, academics, and other contributors to research, so their work can always be linked back to them, no matter where it’s published.

Think of it like a social security number, but for researchers—it’s unique to each person and stays with them throughout their career. This is especially useful because many researchers might have similar names, or their names might appear differently in different publications.

For example:
An ORCID might look like this: 0000-0001-2345-6789

With this number, all of a researcher’s work—papers, data, and other contributions—can be easily found and correctly attributed to them. It helps avoid confusion and makes it simple to track their achievements over time.

Creators and contributors

We have automated the import of the ORCID database, which, at the time of writing, contains over 21 million ORCID records, and the import of CERN users with an account.

Where can you add your colleagues to attribute for their contributions? There are dedicated fields in the deposit form:

Creators field
Contributors field

In the latest CDS platform upgrade, we improved the search capabilities and display of the autocompletion widget. You can now find your colleagues easier and add them even faster.

Preview of authors search

We highly recommend creating your own ORCID profile using this link. It is a must-have for any researcher!

Once you obtain and ORCID, you should connect it with your CERN Profile, using the Users Portal, as shown:

Finding affiliations

It is in a researcher’s best interest to provide the all possible metadata describing their research – more information they provide, higher the chance for other researchers to find about it.

Information about the researchers’ affiliations might be particularly important for highlighting the relationship with the organization which funded the research. Therefore, providing the up-to-date information about your affiliation will work well.

In CDS, we usually prefill that field for you, but our sources might not have the latest data, so you are able to fill the affiliations manually if needed. How? By using ROR identifiers (yes, yet another identifier!) or simply searching for the organization’s name.

A ROR identifier (Research Organization Registry identifier) is a unique, permanent ID used to identify research organizations worldwide. It ensures that institutions like universities, labs, and funding bodies are consistently and accurately recognized in research metadata, publications, and data repositories.

For example, many institutions have similar or changing names, which can create confusion. A ROR identifier solves this by providing a standardized, unchanging ID, like https://ror.org/01ggx4157 for CERN, which always points to the same organization.

ROR is open, free, and interoperable with other systems like DOIs (for publications) and ORCID (for researchers), helping improve the discoverability and tracking of research outputs. It’s a key tool for making research more connected and organized globally.

Affiliation field

Rich metadata

We made sure to put in place all these integrations with ORCiD, DOI, ROR and the CERN databases in order to make it easy for you to fill out the most metadata automatically (and quickly!). We strive for seamless user experience, and we really hope that all these features will make your upload easier and… more complete!

Find most viewed and downloaded

Curious to know what are the most viewed or most downloaded records? You can now easily find it out by sorting searching results.

🎄☃️❄️ Best wishes and a happy new year!

Image generated by ChatGPT with DALL·E.

The CDS team wishes you a joyful holiday season and a bright, successful start to the New Year!

Updates on the new CDS

We’re excited to announce a set of new features and improvements to the new CERN Document Server (CDS), now running on InvenioRDM v12! Our focus has been on making CDS more user-friendly and efficient for everyone.

The long list of all changes is detailed in the full changelog. Below, the main highlights more relevant to the CERN community.

What’s new?

Enhanced Search Accuracy

We’ve improved the search functionality to make finding records easier and more precise. Whether you’re looking for specific datasets or publications, the search engine is now optimized to deliver more accurate results, helping you locate the content you need faster.

Mathematical Formula Rendering

For researchers working with complex formulas, CDS now nicely renders LaTeX formulas within search results and records.

Content Policy and Terms of Use

To ensure transparency and safeguard users, we have added a Content Policy and Terms of Use to the platform. These documents clarify the rules regarding content submission and usage on CDS, helping maintain a safe and collaborative environment.

Introduction of sub-communities

One of the biggest updates is the feature of sub-communities. Communities on CDS can now be nested, meaning a community can have a parent community. This change brings more flexibility in organizing and structuring data, catering to the diverse needs of departments and research groups.

User Interface Tweaks

We’ve implemented a series of UI improvements, including fixing issues with community logos and adding enhanced loading icons during login and logout among other fixes. These small but impactful updates make the interface more user-friendly and visually coherent.

Migration of the 1st collection

Under the hood, we are working hard to migrate the very first collection of documents from the current CDS repository to our new platform. We are now in the process of testing the migration of documents and files, and ensuring the correct redirection of web links.

This is a very important milestone: it will prove that the migration processes that we have put in place are working as expected, and it will unblock the migration of the next collection of documents.

What’s next?

Our team is already working on new features for future releases:

Collections

We are developing a way to categorize records easily within a community (or independently) based on metadata. This allows users to organize and navigate records more intuitively.

Automatic Ingestion of ORCID and ROR Values

To save time and streamline workflows, we are working on automating the ingestion of ORCID and ROR data into the system, ensuring that author and organization identifiers are up-to-date without manual input.

Integration with CERN users database

We are working on making CERN users findable when searching for authors or collaborators during an upload.

Stay tuned for more updates, and as always, feel free to share your feedback with us!

The new Zenodo is live!

In the entire 2023, until October, our team worked in closed collaboration with the Zenodo team to launch the new version, now based on InvenioRDM, the turn-key research data management repository platform.

You can read more about this very important milestone in the official blog post and the OpenAire blog.

The future CDS

This is also a fundamental step for the future version of CDS, which is also based on InvenioRDM. Thanks to this new Zenodo launch, InvenioRDM is now a battle-tested platform, and it will receive constant improvements to make sure that it fulfils the needs of researchers worldwide.

We have learned a ton preparing the new version of Zenodo, not only developing features, but also preparing the infrastructure. With all these lessons-learned, the new CDS will be a more reliable and performant platform.

Next steps

We will work until the end of this year 2023 to analyze the features available today in CDS, and identify the ones that are essential to migrate to the new version.

We are working on a detailed migration plan, and we will get in contact with the main communities to better understand their needs and ensure a smooth transition from the current CDS to the new one, in 2024.

We are very excited, and we are looking forward to seeing the new CDS being used at CERN!

Updates on the new CDS

Summer has already started 😎 and, in the previous months, we have worked hard to integrate the latest development in the new CDS platform.

The result looks beautiful!

The new CDS platform is the brand-new version of the current CERN institutional repository, a modern and easy-to-use website where CERN users can archive and share their research, multimedia content or departmental documents.

You can now preview and try out the latest features in our test instance https://sandbox-cds-rdm.web.cern.ch (reachable from inside CERN campus). Just to mention a few, we have integrated users and groups CERN databases; newly uploaded publications will now have a DOI out-of-the box, ready to be shared and cited; files are securely stored in EOS file system. And there is much more.

The “Browse” section contains links to collections and categories to the former CDS platform: we will slowly migrate data to this brand-new CDS.

The footer of the new CDS website contains useful links to make sure that you will find the information that you need.

The production instance https://new-cds.cern.ch will be soon start to be used by some selected communities at CERN, and we will gather feedback to continuously improve it and make it as easy as possible to use.

After summer, more features will be coming 🚀: we will make it very easy to restrict and share documents with other users, and we will work on the administration panel to fully manage records and users in the system.

This version is just the base for the future CDS. More features will be needed to support all current use cases. To that end, we will be contacting and working together all main users so that we can define together the plan for completion of this future Institutional Repository.

If you wish to, open the new CDS website, login, try it out and share feedback with us!

The new CDS, based on InvenioRDM

With the LTS release (v9) and the latest release (v10), InvenioRDM has reached the maturity needed for production-ready digital repository websites. InvenioRDM is a generic data management repository, developed by our team in collaboration with many partners all over the world. Free to use and open-source.

The InvenioRDM demo website.

As already done by several partners (e.g. Caltech University, TU Graz University, TU Wien University), our team worked hard to create a preview version of the future CDS, available at https://sandbox-cds-rdm.web.cern.ch.

The new CDS website, based on InvenioRDM.

As first milestone, we have created and deployed the new instance of CDS and also migrated a selected set of records, metadata-only. This initial setup will allow us to iterate with the process of data migration, expanding incrementally the number of records and improving the data quality.

In the first quarter of this year, we will continue working on the InvenioRDM product, adding more features and integrating them in the new CDS website.

We will also start an analysis of the feature-set available in the current CDS, but still missing in the new platform: thanks to this, we will be able to come up with a plan for the next steps.

We are very excited to finally see the new CDS taking shape! Stay tuned for future announcements!

Powered by WordPress & Theme by Anders Norén