feat: Add search template mutation #9802

Dschoordsch · 2024-05-30T11:41:15Z

Description

Fixes #9774, #9775
Add a template search query to the User object.

Demo

https://www.loom.com/share/114efbf0a1864ba09816c0bf87a28620?sid=86867080-17c0-405a-bf6a-828a591bdfad

Testing scenarios

search for templates in the activity library
create custom templates, wait for the embedding (ca. 1min, you can check EmbeddingsJobQueue is empty) and search for it

Final checklist

I checked the code review guidelines
I have added Metrics Representative as reviewer(s) if my PR invovles metrics/data/analytics related changes
I have performed a self-review of my code, the same way I'd do it for any other team member
I have tested all cases I listed in the testing scenarios and I haven't found any issues or regressions
Whenever I took a non-obvious choice I added a comment explaining why I did it this way
I added the label Skip Maintainer Review Indicating the PR only requires reviewer review and can be merged right after it's approved if the PR introduces only minor changes, does not contain any architectural changes or does not introduce any new patterns and I think one review is sufficient'
PR title is human readable and could be used in changelog

The tests only work once the embeddings are calculated, which does not happen on CI.

Dschoordsch · 2024-06-25T10:47:18Z

packages/client/components/ActivityLibrary/ActivityLibrary.tsx

@@ -224,13 +258,29 @@ export const ActivityLibrary = (props: Props) => {
    onQueryChange,
    resetQuery
  } = useSearchFilter(templates, getTemplateDocumentValue)
-  const [debouncedSearchQuery] = useDebounce(searchQuery, 500)


@tianrunhe I changed the debounced query logic. Because this is used for analytics, let me know if that's ok.

nickoferrall · 2024-06-25T16:15:46Z

Hey @Dschoordsch, I'm getting a timeout error when using the semantic search: https://www.loom.com/share/11e27dce1ba747829b205b87f090a932

Is there any set-up required to test this one?

 rejection:
17:13:43 1|Socket Server     | Error: TIMEOUT
17:13:43 1|Socket Server     |     at Timeout._onTimeout (/Users/nickoferrall/parabol/dev/web.js:80818:16)
17:13:43 1|Socket Server     |     at listOnTimeout (node:internal/timers:573:17)
17:13:43 1|Socket Server     |     at process.processTimers (node:internal/timers:514:7)
17:14:25 2|GraphQL Executor  | TypeError: fetch failed
17:14:25 2|GraphQL Executor  |     at fetch (/Users/nickoferrall/parabol/node_modules/undici/index.js:112:15)
17:14:25 2|GraphQL Executor  |     at async Object.templateSearch (/Users/nickoferrall/parabol/dev/gqlExecutor.js:36867:22)
17:14:25 2|GraphQL Executor  | Trace: {
17:14:25 2|GraphQL Executor  |   error: '{"data":null,"errors":[{"message":"fetch failed","locations":[{"line":1,"column":575}],"path":["viewer","templateSearch"]}]}'
17:14:25 2|GraphQL Executor  | }

tianrunhe

LGTM metrics side

Dschoordsch · 2024-06-26T07:56:12Z

I'm getting a timeout error when using the semantic search

I tried to reproduce this, but had no success yet:

when no embedder is running and the embedding table is empty, I don't get a result but no error either
when I stop the text-embeddings-inference container, I get an error in fetch but a different one
I tried querying right after startup when the text-embeddings-inference container could be busy, but did not get any error

@nickoferrall Are you processing lots of data atm? Can you check your EmbeddingsJobQueue table how many entries you have?

nickoferrall · 2024-06-26T09:03:12Z

My EmbeddingsJobQueue had over 100 entries, and my Embeddings_ember_1 table had no entries. I removed all of the entries in the job queue, but got the same result.

Are these env vars the same as .env.example for you?

nickoferrall · 2024-06-26T11:50:42Z

@Dschoordsch after our call, I removed everything from Docker, restarted my laptop, cleared the job queue and metadata a couple of times, but no luck, unfortunately. The embeddings service is still slow to respond so I get a timeout.

We don't know whether it shouldn't be working for you, or it should be working for me, so perhaps someone else could test this PR so we can isolate the problem?

Could @tianrunhe test it as you're here please?

tianrunhe · 2024-06-26T16:07:48Z

My EmbeddingsJobQueue had over 100 entries, and my Embeddings_ember_1 table had no entries. I removed all of the entries in the job queue, but got the same result.

I encountered the same behavior as Nick. The stateMessage field for the rows are unable to get tokens:

Dschoordsch · 2024-06-26T18:57:42Z

Could one of you try deleting the dev folder

rm -r dev

and the embedding data

delete from "Embeddings_ember_1";
delete from "EmbeddingsMetadata";
delete from "EmbeddingsJobQueue";

and then try again?

Dschoordsch · 2024-06-27T13:37:49Z

@nickoferrall ☝️ could you try one more time? I noticed that locally I have some issues with the embedding service not always rebuilding.

nickoferrall · 2024-06-27T16:57:46Z

I tried the step above, but no luck.

I'm working on getting the text-embeddings-inference server install on bare metal here following this now

nickoferrall · 2024-06-28T14:48:18Z

I got the embedder up and running on my m1, but it’s still not working 😞

The job queue is empty as is the Embeddings_ember_1 table

Dschoordsch · 2024-07-01T08:25:12Z

I will go ahead and merge the change. If it fails, it will just yield no results, which is safe. As embeddings work in production, I'm positive the search will work there.

nickoferrall · 2024-07-01T10:07:20Z

packages/server/graphql/public/types/User.ts

+    // all team ids which could have accessible templates
+    const allTeamIds = ['aGhostTeam', ...allOrgTeams.map(({id}) => id)]
+
+    const response = await fetch('http://localhost:3040/embed', {


-1 this should be a env variable instead. I'm now using port 3041 for the text-embeddings-inference server because I'm running on my mac rather than Docker

Also, no localhost or port should be hardcoded into our code as each component can run in any port and location in other environments.

Dschoordsch marked this pull request as draft May 30, 2024 11:41

github-actions bot added the size/m label May 30, 2024

Base automatically changed from feat/9772/createMeetingTemplateEmbeddings to master June 13, 2024 09:27

Dschoordsch added 5 commits June 18, 2024 21:07

feat: Create embeddings for meeting templates

c8a4cc0

feat: Add search template mutation

b287229

Use a search tailored debounce and show a loading indicator

94bb3ba

Minor design tweaks

6ca4b43

Tweak the template search threshold

17235e3

Dschoordsch force-pushed the feat/9774/templateSearchMutation branch from 48c7007 to 17235e3 Compare June 24, 2024 15:35

Skip template search tests

3881d08

The tests only work once the embeddings are calculated, which does not happen on CI.

Dschoordsch requested a review from nickoferrall June 25, 2024 10:44

Dschoordsch marked this pull request as ready for review June 25, 2024 10:44

Dschoordsch commented Jun 25, 2024

View reviewed changes

Dschoordsch requested a review from tianrunhe June 25, 2024 10:47

tianrunhe approved these changes Jun 25, 2024

View reviewed changes

Dschoordsch merged commit 486f670 into master Jul 1, 2024
7 checks passed

Dschoordsch deleted the feat/9774/templateSearchMutation branch July 1, 2024 08:25

parabol-release-bot bot mentioned this pull request Jul 1, 2024

chore(release): release v7.38.0 #9905

Merged

nickoferrall reviewed Jul 1, 2024

View reviewed changes

rafaelromcar-parabol self-requested a review July 1, 2024 10:10

github-actions bot mentioned this pull request Jul 2, 2024

chore(release): Test v7.38.0 #9921

Merged

24 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Add search template mutation #9802

feat: Add search template mutation #9802

Dschoordsch commented May 30, 2024 •

edited

Loading

Dschoordsch Jun 25, 2024

nickoferrall commented Jun 25, 2024

tianrunhe left a comment

Dschoordsch commented Jun 26, 2024

nickoferrall commented Jun 26, 2024

nickoferrall commented Jun 26, 2024

tianrunhe commented Jun 26, 2024

Dschoordsch commented Jun 26, 2024 •

edited

Loading

Dschoordsch commented Jun 27, 2024

nickoferrall commented Jun 27, 2024

nickoferrall commented Jun 28, 2024

Dschoordsch commented Jul 1, 2024

nickoferrall Jul 1, 2024

rafaelromcar-parabol Jul 1, 2024

feat: Add search template mutation #9802

feat: Add search template mutation #9802

Conversation

Dschoordsch commented May 30, 2024 • edited Loading

Description

Demo

Testing scenarios

Final checklist

Dschoordsch Jun 25, 2024

Choose a reason for hiding this comment

nickoferrall commented Jun 25, 2024

tianrunhe left a comment

Choose a reason for hiding this comment

Dschoordsch commented Jun 26, 2024

nickoferrall commented Jun 26, 2024

nickoferrall commented Jun 26, 2024

tianrunhe commented Jun 26, 2024

Dschoordsch commented Jun 26, 2024 • edited Loading

Dschoordsch commented Jun 27, 2024

nickoferrall commented Jun 27, 2024

nickoferrall commented Jun 28, 2024

Dschoordsch commented Jul 1, 2024

nickoferrall Jul 1, 2024

Choose a reason for hiding this comment

rafaelromcar-parabol Jul 1, 2024

Choose a reason for hiding this comment

Dschoordsch commented May 30, 2024 •

edited

Loading

Dschoordsch commented Jun 26, 2024 •

edited

Loading