Blog

Reference matching: for real this time

In my previous blog post, Matchmaker, matchmaker, make me a match, I compared four approaches for reference matching. The comparison was done using a dataset composed of automatically-generated reference strings. Now it’s time for the matching algorithms to face the real enemy: the unstructured reference strings deposited with Crossref by some members. Are the matching algorithms ready for this challenge? Which algorithm will prove worthy of becoming the guardian of the mighty citation network? Buckle up and enjoy our second matching battle!

Phew - its been quite a year

As the end of the year approaches it’s useful to look back and reflect on what we’ve achieved over the last 12 months—a lot! To be honest, there were some things we didn’t get done—or didn’t make as much progress with as we hoped—but that happens when you have an ambitious agenda. However, we also got some things done that we didn’t expect to or that weren’t even on our radar at the end of 2017—this is inevitable as the research and scholarly communications landscape is rapidly changing.

Newly approved membership terms will replace existing agreement

In its July 2018 meeting, the Crossref Board voted unanimously to approve and introduce a new set of membership terms. At the same meeting, the board also voted to change the description of membership eligibility in our Bylaws, officially broadening our remit beyond publishers, in line with current practice and positioning us for future growth.

Updates to our by-laws

Good governance is important and something that Crossref thinks about regularly so the board frequently discusses the topic, and this year even more so. At the November 2017 meeting there was a motion passed to create an ad-hoc Governance Committee to develop a set of governance-related questions/recommendations. The Committee has met regularly this year and the following questions are under deliberation regarding term limits, role of the Nominating Committee, implications of contested elections, and more.

Data Citation: what and how for publishers

We’ve mentioned why data citation is important to the research community. Now it’s time to roll up our sleeves and get into the ‘how’. This part is important, as citing data in a standard way helps those citations be recognised, tracked, and used in a host of different services.

Matchmaker, matchmaker, make me a match

Matching (or resolving) bibliographic references to target records in the collection is a crucial algorithm in the Crossref ecosystem. Automatic reference matching lets us discover citation relations in large document collections, calculate citation counts, H-indexes, impact factors, etc. At Crossref, we currently use a matching approach based on reference string parsing. Some time ago we realized there is a much simpler approach. And now it is finally battle time: which of the two approaches is better?

What does the sample say?

At Crossref Labs, we often come across interesting research questions and try to answer them by analyzing our data. Depending on the nature of the experiment, processing over 100M records might be time-consuming or even impossible. In those dark moments we turn to sampling and statistical tools. But what can we infer from only a sample of the data?

Why Data Citation matters to publishers and data repositories

A couple of weeks ago we shared with you that data citation is here, and that you can start doing data citation today. But why would you want to? There are always so many priorities, why should this be at the top of the list?

Ten more days ’til Toronto

Our LIVE Annual Meeting is back in North America for the first time since 2015, and with just 10 days to go, there’s a lot going on in preparation. As you’d expect with a How good is your metadata? theme—the two-days will be entirely devoted to the subject of metadata—because it touches everything we do, and everything that publishers, hosting platforms, funders, researchers, and librarians do. Oh, and it’s actually super awesome too—and occasionally fun.

Crossref LIVE Brazil evoked vibrant Q&A session

There has been a steady increase in the growth of our membership in Latin America—and in Brazil in particular—over the past few years. We currently have more than 800 Brazil-based members; some as individual members, but most are sponsored by another organization. As part of our LIVE Local program Chuck Koscher and I traveled to meet some of these members in Goiânia and Fortaleza, where we co-hosted events with Associação Brasileira de Editores Científicos do Brasil (ABEC Brasil)—one of our largest Sponsors.