How should one parse unit names that are not valid identifiers? #554

jameshiebert · 2017-09-21T18:36:36Z

I'm using pint to do checking and conversion of units for some database insertion code and I'm a bit confused about how to define the unit '%'.

Basically, I have a set of weather observations coming in from multi-source input files that are attributed with units, and I'm stuffing them into a database table that may be attributed with different (but compatible) units.

In particular, we're pulling in relative humidity data that are attributed with the units "%" in the database.

Based on the comments in #226, I assumed that the new parser would allow units like " ' and %. But then in pint's test suite the test for this case seems to be intended to fail?

    @unittest.expectedFailure
    def test_issue25(self):
        x = ParserHelper.from_string('10 %')

Or am I reading that wrong?

Defining the unit '%' doesn't error out, but then trying to parse it does:

>>> import pint
>>> ureg = pint.UnitRegistry()
>>> ureg.define('% = percent')
>>> ureg.parse_expression('%')
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/home/hiebert/code/crmprtd/env36/lib/python3.6/site-packages/pint/registry.py", line 840, in parse_expression
    return build_eval_tree(gen).evaluate(lambda x: self._eval_token(x,
AttributeError: 'NoneType' object has no attribute 'evaluate'

Is there some other way that I should be doing this? Thanks for any info that you can provide.

The text was updated successfully, but these errors were encountered:

hgrecco · 2017-09-22T12:54:44Z

The new parser should definitely allow this as we have decoupled from Python's eval. But we are still using the Python tokenizer. Notice this:
The interesting thing is that:

>>> ureg.parse_expression('10%')
10

I think that the quickest way in your case will be to introduce a replacement rule '%' -> '* 0.01'.

dalito · 2017-09-27T21:38:46Z

Also interesting:

>>> ureg.parse_expression(u'2%4')
8                               
>>> 2%4                         
2

jameshiebert · 2017-09-27T22:04:13Z

Thanks @hgrecco , I'll give that a shot.

>>> 2%4 
2

Well... that's 2 [mod] 4, so that doesn't surprise me.

hgrecco · 2019-12-21T04:56:28Z

You can now add your own preprocessors See #911

jameshiebert · 2019-12-22T07:06:16Z

Perfect, thanks!

hgrecco closed this as completed Dec 21, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How should one parse unit names that are not valid identifiers? #554

How should one parse unit names that are not valid identifiers? #554

jameshiebert commented Sep 21, 2017

hgrecco commented Sep 22, 2017

dalito commented Sep 27, 2017

jameshiebert commented Sep 27, 2017

hgrecco commented Dec 21, 2019

jameshiebert commented Dec 22, 2019

How should one parse unit names that are not valid identifiers? #554

How should one parse unit names that are not valid identifiers? #554

Comments

jameshiebert commented Sep 21, 2017

hgrecco commented Sep 22, 2017

dalito commented Sep 27, 2017

jameshiebert commented Sep 27, 2017

hgrecco commented Dec 21, 2019

jameshiebert commented Dec 22, 2019