Skip to content
Change the repository type filter

All

    Repositories list

    • SLAM-LLM

      Public
      Speech, Language, Audio, Music Processing with Large Language Model
      Python
      MIT License
      55612120Updated Dec 12, 2024Dec 12, 2024
    • VQTalker

      Public
      [AAAI 2025] VQTalker: Towards Multilingual Talking Avatars through Facial Motion Tokenization
      Apache License 2.0
      01300Updated Dec 12, 2024Dec 12, 2024
    • 0000Updated Dec 10, 2024Dec 10, 2024
    • kaiyu

      Public
      Kaiyu, Shanghai Jiao Tong University
      HTML
      0000Updated Dec 8, 2024Dec 8, 2024
    • [AAAI 2024] CTX-txt2vec, the acoustic model in UniCATS
      Python
      86350Updated Nov 18, 2024Nov 18, 2024
    • A Universal Platform for Training and Evaluation of Mobile Interaction
      Python
      Apache License 2.0
      43900Updated Nov 14, 2024Nov 14, 2024
    • Xmart

      Public
      Xmart学生论坛
      Apache License 2.0
      0200Updated Nov 13, 2024Nov 13, 2024
    • Codes for paper "Converging to a Lingua Franca: Evolution of Linguistic Regions and Semantics Alignment in Multilingual Large Language Models"
      Python
      0200Updated Oct 29, 2024Oct 29, 2024
    • MBS

      Public
      [COLING 2024] Multilingual Brain Surgeon: Large Language Models Can be Compressed Leaving No Language Behind
      Python
      1300Updated Oct 12, 2024Oct 12, 2024
    • x-lance.github.io

      Public template
      【🛠️🛠️🛠️This Page is Under Construction!!!】Welcome to X-LANCE! Cross Media Language Intelligence Lab in Shanghai Jiao Tong University.
      HTML
      MIT License
      11k000Updated Oct 7, 2024Oct 7, 2024
    • [ICASSP 2024] This is the official code for "VoiceFlow: Efficient Text-to-Speech with Rectified Flow Matching"
      Python
      2132070Updated Sep 3, 2024Sep 3, 2024
    • AniTalker

      Public
      [ACM MM 2024] This is the official code for "AniTalker: Animate Vivid and Diverse Talking Faces through Identity-Decoupled Facial Motion Encoding"
      Jupyter Notebook
      Apache License 2.0
      1371.5k80Updated Aug 15, 2024Aug 15, 2024
    • [EMNLP 2022] Leaderboard of META-GUI
      CSS
      0000Updated Jul 9, 2024Jul 9, 2024
    • [EMNLP 2022] The baseline code for META-GUI dataset
      Python
      MIT License
      31230Updated Jul 9, 2024Jul 9, 2024
    • Python
      Apache License 2.0
      0400Updated Jun 21, 2024Jun 21, 2024
    • [AAAI 2024] Code for CTX-vec2wav in UniCATS
      Python
      1612360Updated Jun 11, 2024Jun 11, 2024
    • [NAACL 2024] CoE-SQL: In-Context Learning for Multi-Turn Text-to-SQL with Chain-of-Editions
      Python
      0810Updated May 7, 2024May 7, 2024
    • StoryTTS

      Public
      [ICASSP 2024] StoryTTS: A Highly Expressive Text-to-Speech Dataset with Rich Textual Expressiveness Annotations
      HTML
      Other
      413720Updated Apr 27, 2024Apr 27, 2024
    • weblm

      Public
      [WSDM 2024] Hierarchical Multimodal Pre-training for Visually Rich Webpage Understanding
      01320Updated Mar 6, 2024Mar 6, 2024
    • WebSRC

      Public
      [EMNLP 2021] WebSRC: A dataset for web based structural machine reading comprehension.
      CSS
      0400Updated Feb 13, 2024Feb 13, 2024
    • MSDWILD

      Public
      [INTERSPEECH 2022] This dataset is designed for multi-modal speaker diarization and lip-speech synchronization in the wild.
      HTML
      Other
      14310Updated Jan 24, 2024Jan 24, 2024
    • [EMNLP 2023 Findings] ACT-SQL: In-Context Learning for Text-to-SQL with Automatically-Generated Chain-of-Thought
      Python
      41831Updated Jan 11, 2024Jan 11, 2024
    • PsyAgents

      Public
      An Open-Source Psychotherapy Simulation Platform with Interactive Agents
      0000Updated Jan 4, 2024Jan 4, 2024
    • 整理各研究方向经典论文
      01000Updated Dec 4, 2023Dec 4, 2023
    • [ICASSP 2024] A BiRGAT Model for Multi-intent Spoken Language Understanding with Hierarchical Semantic Frames
      Python
      MIT License
      0510Updated Nov 27, 2023Nov 27, 2023
    • [ACL 2023 Findings] CSS: A Large-scale Cross-schema Chinese Text-to-SQL Medical Dataset
      Python
      Apache License 2.0
      0510Updated May 26, 2023May 26, 2023
    • D4

      Public
      [EMNLP 2022] D4: a Chinese Dialogue Dataset for Depression-Diagnosis-Oriented Chat
      CSS
      0100Updated Mar 13, 2023Mar 13, 2023
    • BER

      Public
      Balanced Error Rate for Speaker Diarization
      Python
      32800Updated Feb 28, 2023Feb 28, 2023
    • Materials of public talks given By SJTU X-LANCE members
      01400Updated Dec 3, 2022Dec 3, 2022
    • [EMNLP 2021] The baseline code for WebSRC dataset.
      HTML
      MIT License
      94720Updated Aug 27, 2022Aug 27, 2022