CLUE1.0总排行榜     CLUE1.1提交规则  |   项目地址
CLUE1.1与CLUE1.0区别:区别与原有的CLUE1.0,CLUE1.1在部分任务启用了新的测试集,训练集和验证集保持不变;CLUE1.0保留CMNLI自然语言推理任务

排行模型研究机构测评时间Score1.0认证AFQMCTNEWS1.0IFLYTEKOCNLI_50KCMNLIWSC1.0CSLCMRC2018CHID1.0C3 1.0
1ShenZhouQQ浏览器实验室(QQ Browser Lab)21-09-1985.881待认证80.5574.1567.6586.3786.4996.5590.9787.8595.5892.65
2HUMANCLUE19-12-0185.610已认证817180.390.376988492.487.196
3Mengzi澜舟科技-创新工场21-09-1484.939待认证81.7975.0665.0882.5786.1396.5589.8783.9596.092.39
4MotianQQ浏览器搜索21-06-2584.056待认证78.373.1865.4684.9785.4494.8390.1785.394.4288.49
5BERTSGSogou Search21-06-2583.824待认证79.8574.1564.5485.9385.395.178983.893.0687.44
6Pangu华为云-循环智能21-04-2383.045待认证78.1172.0765.1983.385.1995.5287.7384.4593.2585.64
7MT-BERTsMeituan NLP21-03-1081.065待认证77.3670.0364.3183.4785.1489.6687.483.289.7980.29
8LICHEE腾讯看点21-01-0880.507待认证76.9770.564.1581.384.5490.6987.479.887.582.22
9roberta_selfrunOPPO小布助手21-09-2980.238待认证77.8869.3763.9280.482.9493.187.2780.190.1177.29
10BERTsBERTs20-12-2480.220待认证76.7769.9463.9282.984.4888.9786.7780.589.5178.44
11UER-ensembleTencentPretrain & TI-ONE20-11-2880.086待认证76.8272.26480.884.0990.3485.8379.1586.0381.6
12Archer-24E-SINGLEsearch-nlp20-12-2479.795待认证77.2669.5462.2783.5785.239085.7375.6585.6683.04
13BI-ALBERTIt's me.21-03-0479.570待认证76.0468.2763.8182.1783.2587.9386.7781.258878.21
14selfrun-ensembleOPPO小布助手20-12-2279.531待认证76.0969.163.9280.482.5691.3887.2778.588.877.29
15Archer-24lsearch-nlp20-11-3079.338待认证77.4469.9662.6982.5784.7887.2485.1774.0585.4184.07
16NEZHA-largeHuawei Noah's Ark lab20-11-1479.289待认证76.5969.3763.6280.9384.2189.3185.2777.986.5379.16
17NvWaConvolutional AI21-05-2779.278待认证76.1266.6161.3181.584.5988.9786.7770.6589.7286.54
18BI-ALBERTIt's me!21-01-2579.206待认证76.0467.8963.8181.7783.0687.9386.779.78877.16
19RoFormerV2 large追一科技22-03-1978.027待认证76.9558.8762.6575.8381.286.2184.9780.587.6885.41
20aotemanaoteman20-12-0377.534待认证75.8368.7562.6577.1382.5485.5283.4378.1586.5774.77

ALBERT(Ensemble)

GitHub/模型网址:

提交日期:9月17日

分数:9月17日

更多详情:

型号说明

阿尔伯特模型集合

参数说明

单任务微调。我们从MNLI为RTE、STS和MRPC优化的模型开始

总参数:-1

共享参数:-1

诊断信息

诊断主混淆矩阵

C N E
C 182 36 40
N 81 189 116
E 17 69 374

C = 对立

N = 不包含

E = 包含

获取排行榜数据成功!