Note added to annoytutorial.ipynb #1137

greninja · 2017-02-05T21:26:16Z

Note explaining why gensim's 'most_similar' method uses multicore whereas annoy's 'most_similar' runs on a single core.

piskvorky · 2017-02-06T00:45:36Z

The description seems incorrect; the parallelism is nothing to do with GIL or Python, it's on the level of BLAS.

Also, typos (space after full stop, space before brackets, GLobal).

greninja · 2017-02-06T04:21:08Z

Okay. Another possible explanation is :

If numpy on your machine is using one of the BLAS libraries like ATLAS or LAPACK, it ll run on multiple cores if the machine has multicore support. And clearly gensim's most_similar method is using numpy's dot operation.

Does this description sound right? I ll make changes accordingly. Also will correct the typos.

tmylk · 2017-02-06T14:25:21Z

That's correct. Please change the PR

tmylk · 2017-02-06T19:52:09Z

Hi, unfortunately using Gensim doesn't guarantee multiple cores. Will I be possible to make it clear?

greninja · 2017-02-06T20:11:52Z

Should I just remove the initial note written in bold?

piskvorky · 2017-03-03T22:06:41Z

docs/notebooks/annoytutorial.ipynb

@@ -179,7 +179,7 @@
    "\n",
    ">**Note**: Initialization time for the annoy indexer was not included in the times. The optimal knn algorithm for you to use will depend on how many queries you need to make and the size of the corpus. If you are making very few similarity queries, the time taken to initialize the annoy indexer will be longer than the time it would take the brute force method to retrieve results. If you are making many queries however, the time it takes to initialize the annoy indexer will be made up for by the incredibly fast retrieval times for queries once the indexer has been initialized\n",
    "\n",
-    ">**Note** : **If you are using gensim, it'll run on multiple cores**. Gensim's 'most_similar' method is using numpy operations in the form of dot product whereas Annoy's method isnt. If 'numpy' on your machine is using one of the BLAS libraries like ATLAS or LAPACK, it'll run on multiple cores(only if your machine has multicore support ). "
+    ">**Note** : Gensim's 'most_similar' method is using numpy operations in the form of dot product whereas Annoy's method isnt. If 'numpy' on your machine is using one of the BLAS libraries like ATLAS or LAPACK, it'll run on multiple cores(only if your machine has multicore support ). "


isnt => isn't

LAPACK is not BLAS.

cores(only => cores (only

support ). => support).

@tmylk , did you review before merging?

Agree that there is a comma missing before "or LAPACK", CC @greninja

What comma? LAPACK is not a BLAS library, neither software uses LAPACK.

Maybe you meant OpenBlas?

greninja added 2 commits February 6, 2017 01:59

Adding a note to annoytutorial.ipynb

2901206

Changed few words in annoytutorial.ipynb

dbde461

Changed the description

e6943fc

greninja added 2 commits February 9, 2017 01:36

Removed the misleading initial note

af10e20

Adding link to the scipy docs for details

974341a

tmylk merged commit 3e3e6dc into piskvorky:develop Feb 16, 2017

piskvorky reviewed Mar 3, 2017

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Note added to annoytutorial.ipynb #1137

Note added to annoytutorial.ipynb #1137

greninja commented Feb 5, 2017

piskvorky commented Feb 6, 2017

greninja commented Feb 6, 2017

tmylk commented Feb 6, 2017 •

edited

Loading

tmylk commented Feb 6, 2017

greninja commented Feb 6, 2017

piskvorky Mar 3, 2017 •

edited

Loading

tmylk Mar 3, 2017

piskvorky Mar 3, 2017 •

edited

Loading

Note added to annoytutorial.ipynb #1137

Note added to annoytutorial.ipynb #1137

Conversation

greninja commented Feb 5, 2017

piskvorky commented Feb 6, 2017

greninja commented Feb 6, 2017

tmylk commented Feb 6, 2017 • edited Loading

tmylk commented Feb 6, 2017

greninja commented Feb 6, 2017

piskvorky Mar 3, 2017 • edited Loading

Choose a reason for hiding this comment

tmylk Mar 3, 2017

Choose a reason for hiding this comment

piskvorky Mar 3, 2017 • edited Loading

Choose a reason for hiding this comment

tmylk commented Feb 6, 2017 •

edited

Loading

piskvorky Mar 3, 2017 •

edited

Loading

piskvorky Mar 3, 2017 •

edited

Loading