Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mangled accents, bibtex proceedings oddity #10

Open
lintool opened this issue Aug 14, 2018 · 3 comments
Open

Mangled accents, bibtex proceedings oddity #10

lintool opened this issue Aug 14, 2018 · 3 comments
Assignees

Comments

@lintool
Copy link
Member

lintool commented Aug 14, 2018

Crawler seems to be mangling accents:

@inproceedings{Begoli_Camacho-Rodriguez_Hyde_Mior_Lemire_2018a, title={Apache Calcite: A Foundational Framework for Optimized Query Processing Over Heterogeneous Data Sources.}, DOI={http://doi.acm.org/10.1145/3183713.3190662}, booktitle={SIGMOD Conference}, author={Begoli, Edmon and Camacho-Rodríguez, Jesús and Hyde, Julian and Mior, Michael and Lemire, Daniel}, year={2018}, pages={221–230}}

Also:
https://dblp.uni-trier.de/rec/bibtex/conf/sigmod/BegoliCHML18

booktitle says:

  booktitle = {Proceedings of the 2018 International Conference on Management of
               Data, {SIGMOD} Conference 2018, Houston, TX, USA, June 10-15, 2018},

Why is it "SIGMOD Conference" above?

Also - why can't we just crawl the bibtex here?
https://dblp.uni-trier.de/rec/bibtex/conf/sigmod/BegoliCHML18

@michaelmior
Copy link
Member

I could separately crawl the BibTeX I suppose. It was just much easier to use the data I had already pulled.

@michaelmior
Copy link
Member

Crawling the BibTeX and deduplicating is probably a better solution, but I pushed this to fix the accents. I'll try to take a look at the other oddity later.

@michaelmior
Copy link
Member

For the proceedings oddity, it looks like "SIGMOD Conference" is the only useful thing DBLP returns from its XML API. Unfortunately, this means significant rewriting to fix which is out of scope for me right now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants