Skip to content

Unitxt 1.12.0

Compare
Choose a tag to compare
@yoavkatz yoavkatz released this 31 Jul 12:25
· 119 commits to main since this release

Main changes

Non backward compatible changes

  • changed method template names "input_fields" and "reference_ fields" (effects only people who wrote custom templates code) by @yoavkatz in #1030
  • Refactor Rouge and Meteor to InstanceMetric for faster score computation - this cause very small variances in scores (well within the confidence internal) by @yoavkatz in #1011
  • Ability to create demo samplers based on instance (this causes changes in random selection of demos in normal mode) by @yoavkatz in #1034

Changes in Catalog

  • safety and regard metrics became instance metrics and named SafetyMetric and RegardMetric by @dafnapension in #1004
  • Remove financebench card since it was removed from HF by @elronbandel in #1016
  • add validation to tldr, remove shuffle from billsum by @alonh in #1038
  • Fix typo in japanese_llama system prompt (issue #964) by @bnayahu in #1056
  • numeric nlg dataset template changes by @ShirApp in #1041

Additions to catalog

New Features

Bug Fixes

  • Solve problem with striping format at LLM as a judge code. by @eladven in #1005
  • Added seed to LLM as judges for consistent results by @yoavkatz in #1029
  • Fixed issues with fresh install by @yoavkatz in #1037
  • WML Inference Engine fix by @pawelknes in #1013
  • replace type and type in type error message by @perlitz in #1035
  • FinQA - filter problematic examples by @ShirApp in #1039
  • demo's target prefix is now taken from demo instance by @dafnapension in #1031
  • Make sure preparation times printed fully and nicely by @elronbandel in #1046
  • Added prediction type to llm as jusdge to avoid warning by @yoavkatz in #1072
  • Fixed confidence interval inconsistency when some metrics compute ci and some do not by @dafnapension in #1065
  • Fix bug in data classes and add support for field overriding in fields containing types or functions by @elronbandel in #1027
  • Set LoadFromIBMCloud verify to be lazy, in order to allow preparing the cards without define FMEVAL_COS_URL by @eladven in #1021
  • Added check of type of format and system prompt to LLM as judge by @yoavkatz in #1068
  • Allow assigning None in overwrites when fetching artifacts with modifications by @dafnapension in #1062
  • fix - building test is not working. Updated Kaggle version. by @benjaminsznajder in #1055

Documentation changes


New Contributors

We want to thank the new contributors for their first contributions!