Skip to content
Change the repository type filter

All

    Repositories list

    • ESC-Eval

      Public
      [EMNLP 2024] ”ESC-Eval: Evaluating Emotion Support Conversations in Large Language Models“
      Python
      0610Updated Oct 24, 2024Oct 24, 2024
    • probing AI intelligence with reflection
      Python
      0500Updated Oct 24, 2024Oct 24, 2024
    • MLLMGuard

      Public
      Python
      21720Updated Oct 22, 2024Oct 22, 2024
    • Flames

      Public
      Flames is a highly adversarial benchmark in Chinese for LLM's harmlessness evaluation developed by Shanghai AI Lab and Fudan NLP Group.
      Apache License 2.0
      03410Updated May 21, 2024May 21, 2024
    • Python
      Apache License 2.0
      0500Updated Mar 22, 2024Mar 22, 2024