Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Outdated or buggy journal abbreviations data #2

Open
ravwojdyla opened this issue Aug 17, 2021 · 4 comments
Open

Outdated or buggy journal abbreviations data #2

ravwojdyla opened this issue Aug 17, 2021 · 4 comments

Comments

@ravwojdyla
Copy link

Not sure if the data is outdated or if there is a bug but some journals have outdated/invalid(?) iso abbreviations. Example from pubmed/J_Medline.txt:

JournalTitle: The New England journal of medicine
MedAbbr: N Engl J Med
ISSN (Print): 0028-4793
ISSN (Online): 1533-4406
IsoAbbr: N Engl J Med
NlmId: 0255562

Notice the Iso and Med abbreviations (are the same), but in dhimmel/delays, they are different: N. Engl. J. Med. (Iso) vs N Engl J Med (Med) (notice the dots).

@ravwojdyla
Copy link
Author

That said, for that journal wikipedia says the ISO is N. Engl. J. Med. (with dots). Would that mean that NLM's pubmed/J_Medline.txt is invalid 🤷? NLM entry is here.

@dhimmel
Copy link
Owner

dhimmel commented Aug 17, 2021

The abbreviation used by PubMed is the "NLM Title Abbreviation", which I believe is the same as MedAbbr. So in PubMed, the journal is displayed as "N Engl J Med" as seen in this search result:

image

Looking at the online NLM journal record at https://www.ncbi.nlm.nih.gov/nlmcatalog/255562, it doesn't appear to list a field for the ISO abbreviation.

So based on your comment, it seems that the NLM catalog via J_Medline.txt used to have the proper abbreviations in IsoAbbr but currently does not (because it is missing the periods)?

@ravwojdyla
Copy link
Author

So based on your comment, it seems that the NLM catalog via J_Medline.txt used to have the proper abbreviations in IsoAbbr but currently does not (because it is missing the periods)?

@dhimmel not saying NLM used to have "proper abbreviations" (I don't know that), not sure which records have changed, just observing in this issue that in this repo's data the ISO abbreviations do have dots, but currently available records in NLM don't. Whether NLM's records are valid, is a separate question. I haven't researched which ISO abbreviation is "correct" :)

dhimmel added a commit that referenced this issue Aug 20, 2021
@dhimmel
Copy link
Owner

dhimmel commented Aug 20, 2021

I updated the NLM catalog export in 83577d4. I looked and IsoAbbr is now always the same as MedAbbr. So seems like these two columns used to refer to different abbreviations, but have been rectified to be the same. Now I am not sure whether the version with or without the periods is the one that follows the actual ISO standard.

I also updated the downstream scopus metrics in dhimmel/scopus@1c2f8aa.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants