<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Aptos;}
@font-face
{font-family:Consolas;
panose-1:2 11 6 9 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
font-size:12.0pt;
font-family:"Aptos",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
pre
{mso-style-priority:99;
mso-style-link:"HTML Preformatted Char";
margin:0cm;
font-size:10.0pt;
font-family:"Courier New";}
span.HTMLPreformattedChar
{mso-style-name:"HTML Preformatted Char";
mso-style-priority:99;
mso-style-link:"HTML Preformatted";
font-family:"Consolas",serif;
mso-fareast-language:EN-GB;}
span.gmailsignatureprefix
{mso-style-name:gmail_signature_prefix;}
span.EmailStyle22
{mso-style-type:personal-reply;
font-family:"Aptos",sans-serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:11.0pt;
mso-ligatures:none;
mso-fareast-language:EN-US;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="EN-GB" link="blue" vlink="purple" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US">Hi Saulius & Antanas<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US">I have prepared a .tsv file which hopefully will allow you to correct some IUCrData entries. These entries are missing the article ID/page number in the COD.
<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US">I have just realized I haven’t added the first line key for the .tsv. I retain the same scheme as last time. My understanding is that the publisher, in this case, has added an ID
Article file in their CrossRef data with blank start and end pages numbers.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US">It looks like most entries have Article ID in the start page field. But that scheme seems to have broken down recently. The publishers have probably made format changes which have
upset your workflow system!<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US">Regards<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US">Bob<o:p></o:p></span></p>
<p class="MsoNormal"><span style="font-size:11.0pt;mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri",sans-serif">From:</span></b><span lang="EN-US" style="font-size:11.0pt;font-family:"Calibri",sans-serif"> Antanas Vaitkus <antanas.vaitkus90@gmail.com>
<br>
<b>Sent:</b> 03 July 2025 12:23<br>
<b>To:</b> Saulius Gražulis <grazulis@ibt.lt><br>
<b>Cc:</b> McMeeking, Robert (STFC,DL,SC) <robert.mcmeeking@stfc.ac.uk>; cod-bugs@ibt.lt<br>
<b>Subject:</b> Re: [Cod-bugs] Missing Journal Information<o:p></o:p></span></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<div>
<p class="MsoNormal" style="margin-bottom:12.0pt">Hello,<o:p></o:p></p>
</div>
<p class="MsoNormal">just a short note on article IDs in CIFs. Since version 3.2.0 the coreCIF dictionary [1]<o:p></o:p></p>
</div>
<p class="MsoNormal">defines the _journal.paper_number data item which is more or less a paper ID:<o:p></o:p></p>
<pre> Article number that is used by some journals instead of a page range.<o:p></o:p></pre>
<pre> Usually applies to electronic-only journals.<o:p></o:p></pre>
<div>
<p class="MsoNormal">However, we do not yet properly support it in the COD deposition pipeline.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">[1] <a href="https://www.iucr.org/__data/iucr/cif/dictionaries/cif_core_3.2.0.dic">
https://www.iucr.org/__data/iucr/cif/dictionaries/cif_core_3.2.0.dic</a><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">Sincerely<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Antanas<o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<p class="MsoNormal">On Thu, 3 Jul 2025 at 14:16, Saulius Gražulis <<a href="mailto:grazulis@ibt.lt">grazulis@ibt.lt</a>> wrote:<o:p></o:p></p>
</div>
<blockquote style="border:none;border-left:solid #CCCCCC 1.0pt;padding:0cm 0cm 0cm 6.0pt;margin-left:4.8pt;margin-right:0cm">
<div>
<div>
<p class="MsoNormal">Hi, Robert,<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">thanks for your answer!<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">On 2025-07-03 14:00, Robert McMeeking - STFC UKRI wrote:<o:p></o:p></p>
</div>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:11.0pt">Thank you very much for the feedback. Getting my brain working again after being on a long holiday was a bit of a struggle! I am, however, now in
a better position to streamline my processes at this and also.</span><o:p></o:p></p>
</blockquote>
<p class="MsoNormal">Sounds great!<br>
<br>
<o:p></o:p></p>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:11.0pt">I have checked a number of the entries on the COD site and they now look fine.
</span><o:p></o:p></p>
</blockquote>
<p class="MsoNormal">Great, ACK.<br>
<br>
<o:p></o:p></p>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:11.0pt">I assume you are happy to map the Article ID onto the Start Page where this is appropriate. In the longer term perhaps “Article ID” might be a new
field?</span><o:p></o:p></p>
</blockquote>
<p>AFAIK (maybe my colleagues will chime in and correct m,e if I am wrong), there is no official IUCr data item for the "Article ID". Publishers are not decided either – some (re)use first-page as the article ID (RSC), some ditch page numbers altogether and
add the article identifier, sometimes without even specifying what properties it has (e.g. will it change for the new revisions of the paper).<o:p></o:p></p>
<p>For our purposes, the first page serves as a good unique synthetic key together with the other bibliographic info (Journal+year+volume+issue+first-page is (nearly) guaranteed to be unique, unless publishers do something totally crazy as having duplicated
page numbers in the same issue...). Thus, using Article ID as the first page when there is no page is suitable for this purpose, and we do it so far like this. A machine readable indication that we have Article ID and not the page number is the NULL value
in the `lastpage` SQL column :).<o:p></o:p></p>
<p>For article identification we can increasingly rely on DOI nowadays.<o:p></o:p></p>
<blockquote style="margin-top:5.0pt;margin-bottom:5.0pt">
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:11.0pt">The CrystalWorks link:</span><o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:11.0pt"><a href="https://cds.dl.ac.uk/cgi-bin/rfm/crystalworks_trawl_new?8799_8892_9714" target="_blank">https://cds.dl.ac.uk/cgi-bin/rfm/crystalworks_trawl_new?8799_8892_9714</a></span><o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:11.0pt"> </span><o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-size:11.0pt">does not pick up the amended details as yet, but I am assuming it will be become available after the next overnight upload to our server – if not
tomorrow quite soon. I will get back if there are any issues to report.</span><o:p></o:p></p>
</blockquote>
<p>Let's see how it updates!<o:p></o:p></p>
<p>Regards,<br>
Saulius<o:p></o:p></p>
<pre>-- <o:p></o:p></pre>
<pre>Dr. Saulius Gražulis<o:p></o:p></pre>
<pre>Vilnius University Institute of Biotechnology, Saulėtekio al. 7<o:p></o:p></pre>
<pre>LT-10257 Vilnius, Lietuva (Lithuania)<o:p></o:p></pre>
<pre>mobile: (+370-684)-49802, (+370-614)-36366<o:p></o:p></pre>
<p class="MsoNormal"><br>
-- <br>
This message has been scanned for viruses and <br>
dangerous content by <a href="http://www.mailscanner.info/" target="_blank"><b>MailScanner</b></a>, and is
<br>
believed to be clean. <o:p></o:p></p>
</div>
</blockquote>
</div>
<div>
<p class="MsoNormal"><br clear="all">
<o:p></o:p></p>
</div>
<p class="MsoNormal"><br>
<span class="gmailsignatureprefix">-- </span><o:p></o:p></p>
<div>
<div>
<div>
<div>
<p class="MsoNormal">Antanas Vaitkus,<o:p></o:p></p>
</div>
<p class="MsoNormal">Vilnius University,<br>
Life Sciences Center,<br>
Institute of Biotechnology,<br>
room C521, Saulėtekio al. 7,<br>
LT-10257 Vilnius, Lithuania<o:p></o:p></p>
</div>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<div>
<p class="MsoNormal" style="margin-bottom:12.0pt"><o:p> </o:p></p>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<br />--
<br />This message has been scanned for viruses and
<br />dangerous content by
<a href="http://www.mailscanner.info/"><b>MailScanner</b></a>, and is
<br />believed to be clean.
</body>
</html>