From grazulis at ibt.lt Mon Aug 11 09:24:13 2025 From: grazulis at ibt.lt (=?UTF-8?Q?Saulius_Gra=C5=BEulis?=) Date: Mon, 11 Aug 2025 09:24:13 +0300 Subject: [Cod-bugs] Missing Journal Information In-Reply-To: References: <70459c5f-188d-4400-a430-1c9e17c442d9@ibt.lt> <029fd080-1e94-4732-a2a6-2cec389c2104@ibt.lt> <567187b5-901f-4e14-8696-b81545b7646e@ibt.lt> <18ee432d-9996-417d-a35c-4d39ad538518@ibt.lt> <426e4000-56e2-4fde-8751-36e1ea11c209@ibt.lt> <16b90bac-3d4f-4fd3-8ed1-a891bb58256d@ibt.lt> Message-ID: On 2025-08-10 18:52, Robert McMeeking - STFC UKRI wrote: > > Hi Saulius > > Here are a list of ?recovered? article titles. It need a bit of work. > For instance there appear to be problems rendering Unicode character > after transfer from the linux server to my laptop. > Thank you very much! > > Will get back with further details later when I get to check things > out on Monday > OK, let me know how it goes. If possible, please include also DOI column for each COD ID (for cross-checks), and add the column headers. Regards, Saulius > Regards > > Bob > > *From:*Saulius Gra?ulis > *Sent:* 07 August 2025 14:42 > *To:* McMeeking, Robert (STFC,DL,SC) ; > 'Antanas Vaitkus' > *Cc:* cod-bugs at ibt.lt > *Subject:* Re: [Cod-bugs] Missing Journal Information > > On 2025-08-07 11:33, Robert McMeeking - STFC UKRI wrote: > > I have done some tests on some of the entries with article title > issues. The ones I checked *do *appear to have titles. I will try > to get the corrections to you quite soon. > > We can find COD entries without titles in the COD SQL database. I can > send you a list of such entries or an SQL query if you would like. > > I have notices problems with a number of the dois. My scripts > appear to have problems containing any if characters: <>() > > Lol ? I had exactly the same problem :). > > I think we need to us 'urlencode' for them before sending them to the > doi.org as a DOI request, it worked for me. I can send yo may shell > hack for this if that would help. > > I can see how these characters might give problems in a Unix > environment. But I assume I should be able to fix the problems. > Having said that I am a bit surprised that these characters are > allowed in valid dois! > > Yes, DOIs are more permissive than even URLs. No idea why they did > that, but that's what we have. Let me know if you would like to look > at any of my hacks on this. > > Regards, > Saulius > > PS. The 40k+ bibliographies for COD entries with missing years or page > numbers were downloaded. I'll have to look at them and sort them out > (some? have failed since the DOIs are not from journals but from > university repos, some journals no longer hand our page numbers or do > not include them into DOI-derived bibliography files...). We'll see > how it works but a lot of COD entries can be fixed now. > > S.G. > > -- > Dr. Saulius Gra?ulis > Vilnius University Institute of Biotechnology, Saul?tekio al. 7 > LT-10257 Vilnius, Lietuva (Lithuania) > mobile: (+370-684)-49802, (+370-614)-36366 > > > -- > This message has been scanned for viruses and > dangerous content by *MailScanner* , and is > believed to be clean. > > > -- > This message has been scanned for viruses and > dangerous content by *MailScanner* , and is > believed to be clean. -- Dr. Saulius Gra?ulis Vilnius University Institute of Biotechnology, Saul?tekio al. 7 LT-10257 Vilnius, Lietuva (Lithuania) mobile: (+370-684)-49802, (+370-614)-36366 -- This message has been scanned for viruses and dangerous content by MailScanner, and is believed to be clean. -------------- next part -------------- An HTML attachment was scrubbed... URL: