Skip to content

Feature Request: GLM-4 9B Support #7778

Closed
@arch-btw

Description

@arch-btw

Prerequisites

  • I am running the latest code. Mention the version if possible as well.
  • I carefully followed the README.md.
  • I searched using keywords relevant to my issue to make sure that I am creating a new issue that is not already open (or closed).
  • I reviewed the Discussions, and have a new and useful enhancement to share.

Feature Description

It would be really cool to have support for these models that were released today. They have some very impressive benchmarks. I've also been trying out the model in huggingface spaces myself and noticed it speaks a lot of languages fluently and is knowledgeable on many topics. Thank you for your time.

Here are the download links:

Here is the English README: README_en.md

Motivation

The motivation for this feature are found in some of the technical highlights for this model:

  • These models were trained on 10T tokens.
  • GLM-4-9B-Chat models have 9B parameters.
  • GLM-4-9B-Chat-1M model supports 1M context length and scored 100% on the needle in haystack challenge.
  • GLM-4-9B models support 26 languages.
  • Has a vision model (glm-4v-9b).
  • Early impressions are impressive.

Here are some of the results:

Needle challenge:

eval_needle

Longbench:

longbench

Possible Implementation

We might be able to use some of the code from: #6999.

There is also chatglm.cpp but it doesn't support GLM-4.

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions