Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Fix] Fix the KV Cache insertion logic for quantized OPT #1648

Merged
merged 13 commits into from
Jul 19, 2023

Commits on Jul 3, 2023

  1. initial commit

    bogunowicz@arrival.com committed Jul 3, 2023
    Configuration menu
    Copy the full SHA
    5dc1977 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    d3c6038 View commit details
    Browse the repository at this point in the history

Commits on Jul 10, 2023

  1. swapped transpose and quantizelienar

    bogunowicz@arrival.com committed Jul 10, 2023
    Configuration menu
    Copy the full SHA
    87d03f9 View commit details
    Browse the repository at this point in the history

Commits on Jul 11, 2023

  1. tiptoeing towards the fix

    dbogunowicz committed Jul 11, 2023
    Configuration menu
    Copy the full SHA
    d2fffbd View commit details
    Browse the repository at this point in the history

Commits on Jul 14, 2023

  1. cleanup, came up with a better idea for a fix

    bogunowicz@arrival.com committed Jul 14, 2023
    Configuration menu
    Copy the full SHA
    6fcd3f2 View commit details
    Browse the repository at this point in the history
  2. revert a mistake

    bogunowicz@arrival.com committed Jul 14, 2023
    Configuration menu
    Copy the full SHA
    de8ebf7 View commit details
    Browse the repository at this point in the history
  3. Delete hehe2.py

    dbogunowicz committed Jul 14, 2023
    Configuration menu
    Copy the full SHA
    d79fec2 View commit details
    Browse the repository at this point in the history

Commits on Jul 17, 2023

  1. producing good looking graph lets test in deepsparse

    bogunowicz@arrival.com committed Jul 17, 2023
    Configuration menu
    Copy the full SHA
    53bfcd3 View commit details
    Browse the repository at this point in the history
  2. clean implementation, working in opt

    bogunowicz@arrival.com committed Jul 17, 2023
    Configuration menu
    Copy the full SHA
    5e8c649 View commit details
    Browse the repository at this point in the history
  3. Merge remote-tracking branch 'origin/main' into fix/damian/quantized_…

    …opt_cache
    bogunowicz@arrival.com committed Jul 17, 2023
    Configuration menu
    Copy the full SHA
    d2537a3 View commit details
    Browse the repository at this point in the history

Commits on Jul 18, 2023

  1. simplify the PR

    bogunowicz@arrival.com committed Jul 18, 2023
    Configuration menu
    Copy the full SHA
    cf201e5 View commit details
    Browse the repository at this point in the history
  2. ready for rereview

    bogunowicz@arrival.com committed Jul 18, 2023
    Configuration menu
    Copy the full SHA
    4a83566 View commit details
    Browse the repository at this point in the history

Commits on Jul 19, 2023

  1. Configuration menu
    Copy the full SHA
    25c5405 View commit details
    Browse the repository at this point in the history