ZeroCLUE零样本学习榜  Github地址 |  提交样例 |  测评方案 |  测评规则  
模型描述需包含关键词"ZeroCLUE.M"; 提交需实名,即:队伍名称、模型名称、Url/Github、模型描述,需有真实有效。无意义的提交将被移除;有问题发邮件:CLUEbenchmark@163.com
 2021-06-18: 零样本学习榜(ZeroCLUE)启用。

排行模型研究机构测评时间ScoreEPRSTMTCSLDCPTNEWSFIFLYTEKFOCNLIFBUSTMCHIDFCSLFCLUEWSCF
1HumanCLUE21-06-1883.93490.068.07166.090.38887.18498.0
2CPM-BeeOpenBMB&面壁智能23-05-2678.18485.5258.9978.258.8177.7383.8589.6583.687.24
3Ctyun_Big_Model天翼云AI23-02-2476.21787.2548.0277.1359.6275.590.0584.682.981.72
4PaddleNLP-UTC飞桨PaddleNLP23-01-1170.54785.9258.9268.2740.1574.7976.782.7570.674.48
5二郎神-UnifiedMCIDEA研究院22-08-3070.29588.7150.1871.6740.5875.580.1584.8560.681.72
6GPT-MoE阿里云机器学习平台PAI22-08-2269.54584.260.2957.7351.3171.4163.988.269.467.59
7Randeng-T5-784M-MultiTaskIDEA研究院22-12-0169.17685.1352.3269.9342.0874.2881.6575.158.384.48
8Mengzi-T5-MT澜舟科技22-08-2268.92686.9955.1974.7322.4274.6977.685.184.1765.17
9asdf1123-04-2867.70582.8735.9159.652.2370.3374.481.270.574.14
10CPM-BeeModelbest23-04-2867.69882.8735.91052.2370.3374.481.270.574.14
11assistantzz23-05-1565.94087.1260.0269.64576.7867.4549.6560.481.03
12XXXX-TestXXXX-Test22-11-1463.80588.7137.8171.2726.3572.6675.0572.971.0365.86
13二郎神-MRCIDEA研究院22-01-2463.51586.1948.6569.4736.0845.5974.0584.6553.5379.31
14abtest03abtest0322-12-2461.53687.5256.7566.439.5867.1823.373.468.9775.52
15bumble-75lpnhtu23-05-0861.51488.3100073.0777.2584.880.488.28
164731473123-05-1461.23687.1260.0269.64553.65349.6560.481.03
17XXXXRobertaXXXXRoberta22-11-1560.36086.4536.31025.1966.7173.1576.466.652.07
18bumble-20plnuht23-05-0860.14188.4500073.6473.781.178.0386.21
19t5_wut5_wu22-11-1159.96077.0338.5869.9342.0871.2173.764.4552.959.66
20UniCUUniCU22-11-0859.25187.5237.8171.426.1272.6674.4556.655.3363.45

ALBERT(Ensemble)

GitHub/模型网址:

提交日期:9月17日

分数:9月17日

更多详情:

型号说明

阿尔伯特模型集合

参数说明

单任务微调。我们从MNLI为RTE、STS和MRPC优化的模型开始

总参数:-1

共享参数:-1

诊断信息

诊断主混淆矩阵

C N E
C 182 36 40
N 81 189 116
E 17 69 374

C = 对立

N = 不包含

E = 包含

获取排行榜数据成功!