[Cod-bugs] Missing Journal Information
Saulius Gražulis
grazulis at ibt.lt
Mon Aug 11 09:24:13 EEST 2025
On 2025-08-10 18:52, Robert McMeeking - STFC UKRI wrote:
>
> Hi Saulius
>
> Here are a list of “recovered” article titles. It need a bit of work.
> For instance there appear to be problems rendering Unicode character
> after transfer from the linux server to my laptop.
>
Thank you very much!
>
> Will get back with further details later when I get to check things
> out on Monday
>
OK, let me know how it goes. If possible, please include also DOI column
for each COD ID (for cross-checks), and add the column headers.
Regards,
Saulius
> Regards
>
> Bob
>
> *From:*Saulius Gražulis <grazulis at ibt.lt>
> *Sent:* 07 August 2025 14:42
> *To:* McMeeking, Robert (STFC,DL,SC) <robert.mcmeeking at stfc.ac.uk>;
> 'Antanas Vaitkus' <antanas.vaitkus90 at gmail.com>
> *Cc:* cod-bugs at ibt.lt
> *Subject:* Re: [Cod-bugs] Missing Journal Information
>
> On 2025-08-07 11:33, Robert McMeeking - STFC UKRI wrote:
>
> I have done some tests on some of the entries with article title
> issues. The ones I checked *do *appear to have titles. I will try
> to get the corrections to you quite soon.
>
> We can find COD entries without titles in the COD SQL database. I can
> send you a list of such entries or an SQL query if you would like.
>
> I have notices problems with a number of the dois. My scripts
> appear to have problems containing any if characters: <>()
>
> Lol – I had exactly the same problem :).
>
> I think we need to us 'urlencode' for them before sending them to the
> doi.org as a DOI request, it worked for me. I can send yo may shell
> hack for this if that would help.
>
> I can see how these characters might give problems in a Unix
> environment. But I assume I should be able to fix the problems.
> Having said that I am a bit surprised that these characters are
> allowed in valid dois!
>
> Yes, DOIs are more permissive than even URLs. No idea why they did
> that, but that's what we have. Let me know if you would like to look
> at any of my hacks on this.
>
> Regards,
> Saulius
>
> PS. The 40k+ bibliographies for COD entries with missing years or page
> numbers were downloaded. I'll have to look at them and sort them out
> (some have failed since the DOIs are not from journals but from
> university repos, some journals no longer hand our page numbers or do
> not include them into DOI-derived bibliography files...). We'll see
> how it works but a lot of COD entries can be fixed now.
>
> S.G.
>
> --
> Dr. Saulius Gražulis
> Vilnius University Institute of Biotechnology, Saulėtekio al. 7
> LT-10257 Vilnius, Lietuva (Lithuania)
> mobile: (+370-684)-49802, (+370-614)-36366
>
>
> --
> This message has been scanned for viruses and
> dangerous content by *MailScanner* <http://www.mailscanner.info/>, and is
> believed to be clean.
>
>
> --
> This message has been scanned for viruses and
> dangerous content by *MailScanner* <http://www.mailscanner.info/>, and is
> believed to be clean.
--
Dr. Saulius Gražulis
Vilnius University Institute of Biotechnology, Saulėtekio al. 7
LT-10257 Vilnius, Lietuva (Lithuania)
mobile: (+370-684)-49802, (+370-614)-36366
--
This message has been scanned for viruses and
dangerous content by MailScanner, and is
believed to be clean.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.crystallography.net/pipermail/cod-bugs/attachments/20250811/faf80de0/attachment.htm>
More information about the Cod-bugs
mailing list