load_word_topics() returns value #767

bhargavvader · 2016-06-30T13:55:13Z

@tmylk, @piskvorky , with reference to #764 .

load_word_topics() now returns word topics which is assigned by train to self.wordtopics.

Before, there were two variables; word_topics and a wordtopics, and only the wordtopics variable was being used elsewhere; so I removed word_topics.

Is that ok?

tmylk · 2016-06-30T16:09:43Z

Rename wordtopics to word_topics.

Also should there be a meaningful error like 'run train or load_word_topics before showing topics' in https://github.com/bhargavvader/gensim/blob/6fd1ecbfe41d7ab2adc17d20179d467e986f477a/gensim/models/wrappers/ldamallet.py#L244

bhargavvader · 2016-06-30T16:13:29Z

Will do. Also, this PR doesn't seem to have a travis build happening, any reason?

devashishd12 · 2016-06-30T16:55:07Z

oh renaming to word_topics would break some other things actually....

tmylk · 2016-06-30T23:06:04Z

@bhargavvader Well spotted about Travis - other PRs are working ok. Let's see what happens on your next commit.

bhargavvader · 2016-07-01T08:14:19Z

@dsquareindia , exactly what else would break? Not if we change all the wordtopics to word_topics right? The tests written still pass fine.

devashishd12 · 2016-07-01T08:43:03Z

@bhargavvader the mallet2ldamodel would fail and also the support for ldamallet in topic coherence would fail. Tests pass because tests for one of them hasn't been added and for the other one hasn't been merged. I'll change the spelling in my PR but could you please change it in coherencemodel.py? Also a quick git grep would help.

bhargavvader · 2016-07-01T08:52:12Z

@dsquareindia , made the changes in test_ldamallet_wrapper.py, which is the only other place wordtopics shows up in.

coherencemodel.py doesn't seem to really use LdaMallet right now - but I do see it in PR #750 which you're working on, so it would make more sense for you to change it yourself.

…aMallet

devashishd12 · 2016-07-01T09:17:16Z

yeah @bhargavvader I'll make both the changes in my pr once this gets merged. Thanks!

bhargavvader · 2016-07-01T09:17:35Z

@tmylk , @piskvorky , can you have a look?
Also, about the warning, I've just used logger.info now. Is that ok?

tmylk · 2016-07-01T10:35:35Z

gensim/models/wrappers/ldamallet.py

@@ -242,7 +242,9 @@ def show_topics(self, num_topics=10, num_words=10, log=False, formatted=True):
        return shown

    def show_topic(self, topicid, topn=10):
-        topic = self.wordtopics[topicid]
+        if self.word_topics is None:
+            logger.info("Run train or load_word_topics before showing topics.")


warn instead of info

tmylk · 2016-07-01T16:22:05Z

Thanks for the PR. It's a useful fix!

piskvorky · 2016-07-02T07:45:50Z

-1 on renaming wordtopics to word_topics. This breaks backward compatibility for any code that uses that (including a lot of our own code). At least make it an alias.

piskvorky · 2016-07-02T07:46:52Z

gensim/models/wrappers/ldamallet.py

-        logger.info("loaded assigned topics for %i tokens", wordtopics.sum())
-        self.wordtopics = wordtopics
+                word_topics[int(topic), tokenid] += 1.0
+        logger.info("loaded assigned topics for %i tokens", word_topics.sum())
        self.print_topics(15)


This print makes no sense without the self.wordtopics assignment.

@tmylk did you review this before merging??

You're right - the print was there in the code earlier and I removed the self.wordtopics assignment so now it doesn't make sense.

bhargavvader · 2016-07-02T08:25:30Z

@piskvorky , how exactly would one make an alias?
And what exactly do you mean by breaking the older code? I had checked for occurrences of wordtopics in the repo and couldn't find anything I didn't change. Or do you mean tutorials and the like?

I'll make another PR for that and for the unnecessary print statement. Is there anything else you think should change?

piskvorky · 2016-07-02T08:33:33Z

I mean, we use it in our own code, in commercial projects :)

I think it's safe to assume some other people depend on it too, in the same way. Unless it's unavoidable, we try to be backward-compatible with our changes.

Alias = two variables referring to the same object, i.e. x = y = some_var, or x = some_var; y = some_var.

bhargavvader · 2016-07-02T08:53:08Z

Ok, makes sense.

As for making an alias in this case, would you mean adding a line such as self.wordtopics = self.word_topics = word_topics or such whenever an assignment is being made?

piskvorky · 2016-07-02T12:17:31Z

Yes, assign self.wordtopics = self.word_topics, with a big fat comment explaining why this alias is there.

Option two: just continue to call the var self.wordtopics, like it was before.

bhargavvader added 2 commits June 30, 2016 19:14

Returns wordtopics

4dc6ff7

Changed to wordtopics

6fd1ecb

bhargavvader added 3 commits July 1, 2016 14:23

Merge branch 'develop' of https://github.com/piskvorky/gensim into Ld…

b7b09f2

…aMallet

Spelling changes

0545d16

Added log message

65fbb26

tmylk reviewed Jul 1, 2016
View reviewed changes

Changed to warn

ada5b59

tmylk merged commit 003a886 into piskvorky:develop Jul 1, 2016

piskvorky reviewed Jul 2, 2016
View reviewed changes

bhargavvader deleted the LdaMallet branch July 2, 2016 08:23

This was referenced Jul 2, 2016

DTM wrapper bug fixes. #770

Merged

LdaMallet Fixes #771

Merged

tmylk pushed a commit that referenced this pull request Jul 5, 2016

LdaMallet word_topics fixes after #767 (#771)

868716b

devashishd12 mentioned this pull request Jul 19, 2016

LdaMallet wrapper giving AttributeError #796

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

load_word_topics() returns value #767

load_word_topics() returns value #767

bhargavvader commented Jun 30, 2016 •

edited

Loading

tmylk commented Jun 30, 2016 •

edited

Loading

bhargavvader commented Jun 30, 2016

devashishd12 commented Jun 30, 2016

tmylk commented Jun 30, 2016

bhargavvader commented Jul 1, 2016 •

edited

Loading

devashishd12 commented Jul 1, 2016

bhargavvader commented Jul 1, 2016 •

edited

Loading

devashishd12 commented Jul 1, 2016

bhargavvader commented Jul 1, 2016

tmylk Jul 1, 2016

bhargavvader Jul 1, 2016

tmylk commented Jul 1, 2016

piskvorky commented Jul 2, 2016 •

edited

Loading

piskvorky Jul 2, 2016 •

edited

Loading

bhargavvader Jul 2, 2016

bhargavvader commented Jul 2, 2016 •

edited

Loading

piskvorky commented Jul 2, 2016 •

edited

Loading

bhargavvader commented Jul 2, 2016

piskvorky commented Jul 2, 2016 •

edited

Loading

load_word_topics() returns value #767

load_word_topics() returns value #767

Conversation

bhargavvader commented Jun 30, 2016 • edited Loading

tmylk commented Jun 30, 2016 • edited Loading

bhargavvader commented Jun 30, 2016

devashishd12 commented Jun 30, 2016

tmylk commented Jun 30, 2016

bhargavvader commented Jul 1, 2016 • edited Loading

devashishd12 commented Jul 1, 2016

bhargavvader commented Jul 1, 2016 • edited Loading

devashishd12 commented Jul 1, 2016

bhargavvader commented Jul 1, 2016

tmylk Jul 1, 2016

Choose a reason for hiding this comment

bhargavvader Jul 1, 2016

Choose a reason for hiding this comment

tmylk commented Jul 1, 2016

piskvorky commented Jul 2, 2016 • edited Loading

piskvorky Jul 2, 2016 • edited Loading

Choose a reason for hiding this comment

bhargavvader Jul 2, 2016

Choose a reason for hiding this comment

bhargavvader commented Jul 2, 2016 • edited Loading

piskvorky commented Jul 2, 2016 • edited Loading

bhargavvader commented Jul 2, 2016

piskvorky commented Jul 2, 2016 • edited Loading

bhargavvader commented Jun 30, 2016 •

edited

Loading

tmylk commented Jun 30, 2016 •

edited

Loading

bhargavvader commented Jul 1, 2016 •

edited

Loading

bhargavvader commented Jul 1, 2016 •

edited

Loading

piskvorky commented Jul 2, 2016 •

edited

Loading

piskvorky Jul 2, 2016 •

edited

Loading

bhargavvader commented Jul 2, 2016 •

edited

Loading

piskvorky commented Jul 2, 2016 •

edited

Loading

piskvorky commented Jul 2, 2016 •

edited

Loading