ntry-header

 

专利名称:基于统计词典模型的未登录词发现和分词系统及方法
功能:可在没有训练语料库和词库未知的条件下通过无指导文本分析实现中文词汇识别和分词
发明人:邓柯、刘军
专利号:201410299453.9
授权公告日:2017 年 6 月 9 日

 

专利名称:搜索引擎专利《信息检索方法和装置》
功能:根据病历中语意精确寻找符合查询语意的电子病历
发明人:俞声
专利号:ZL 201310200430.3
授权公告日:2016 年 6 月 1 日

#post-11995
ntry-header

 

2017年8月,刘军教授荣获“Jerome Sacks 跨学科研究奖”

2017年8月,林希虹教授荣获“F.N. David奖”

2017年12月27日,林乾教授论文荣获“ICCM Best Paper Award——若琳奖”

2018年1月28日,邓柯教授荣获“2017年度考核校级优秀奖”

2018年3月17-18日,俞声教授团队荣获“解放军总医院急救大数据Datathon”冠军

2016年12月19-22日,刘军教授荣获第十届泛华统计协会“许宝騄奖”(Pao-Lu Hsu Award)

2017年5月,杨立坚教授当选国际数理统计学会会士(IMS Elected Fellow)

2017年6月23日,邓柯副教授荣获“科学中国人2016年度人物”

2015 年 11 月,邓柯教授受邀于中国数学会第十二届全国会议做统计组特邀报告

#post-11994
ntry-header
2017.7-2018.6
  • Zhang, Y. and Yang, L. ,2018. A smooth simultaneous confidence band for correlation curve. TEST27(2),247-269.
  • Zhang, R., Deng, W., Zhu, Y. ,2017. Using Deep Neural Networks to Automate Large Scale Statistical Analysis for Big Data Applications. Proceedings of the 9th Asian Conference on Machine Learning (ACML17), Seoul, Korea, 2017.
  • Pan, C. and Zhu, M. ,2017. Group Additive Structure Identification for Kernel Nonparametric Regression. Advances in Neural Information Processing Systems 30 (NIPS 2017).
  • Huang, Q. and Zhu, Y. 2017. SPOT: Sparse Optimal Transformations for High Dimensional Variable Selection and Exploratory Regression Analysis. Proceedings of the 23rd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD 2017).
  • Cheng, L., Zeng, P., and Zhu, Y. (2017) BS-SIM: An Effective Variable Selection Method for High-dimensional Single Index Model. Electronic Journal of Statistics, 11(2) 3522-3548.
  • 邓柯 (2017) 统计学与人文研究的哲学思辨.《公共管理评论》, 2017年第3期(总第26期), 24-38.
  • Sun Z. Wang T., Deng K., Wang X.F., Lafyatis R., Ding Y., Hu M., Chen W. (2018) DIMM-SC: a Dirichlet Mixture Model for Clustering Droplet-Based Single Cell Transcriptomic Data. Bioinformatics 34(1), 139-146.
  • Li, D., Zhang, X.F., Zhu, K. and Ling, S. (2018) The ZD-GARCH model: A new way to study heteroscedasticity. Journal of Econometrics 202, 1-17.
  • Liu, F., Li, D.* and Kang, X.M. (2018) Sample path properties of an explosive double autoregressive model. Econometric Reviews 37, 484-490.
  • Hou, L., Sun, N., Mane, S., Sayward, F., 2017. Impact of Genotyping Errors on Statistical Power of Association Test in Genomic Analyses: A Case Study. Genetic Epidemiology , 41, pp.152-162.
  • Williams, K.R., Colangelo, C.M., Hou, L. and Chung, L., 2017. Use of a Targeted Urine Proteome Assay (TUPA) to identify protein biomarkers of delayed recovery after kidney transplant. Proteomics Clin Appl 11, pp.7-8.
  • Can,A., Castro, V.M., Ozdemir, Y.H., Dagen, D., Dligach, D., Finan, S., Yu,S., Gainer,V., Shadick, N.A., Murphy, S., Cai, T.C., Savova, G., Weiss, S.T., Du, R.*,2018. Alcohol Consumption and Aneurysmal Subarachnoid Hemorrhage. Translational Stroke Research 9(1), pp.13-19.
  • Can,A., Castro, V.M., Yu, S., Dligach, D., Finan, S., Gainer, V., Shadick, N.A., Savova, G., Murphy, S., Cai, T., Weiss, s.t. and Du, R*. 2018. Antihyperglycemic Agents are Inversely Associated with Intracranial Aneurysm Rupture. Stroke 49(1), 34-39.
  • Yu, S., Ma, Y., Gronsbell, J., Cai, T., Ananthakrishnan, A.N., Gainer, V.S., Churchill, S.E., Szolovits, P., Murphy, S.N. and Kohane, I.S., 2017. Enabling phenotypic big data with PheNorm. Journal of the American Medical Informatics Association 25(1), 54-60.
  • Can,A., Castro, V.M., Ozdemir, Y.H., Dagen, S., Yu, S., Dligach, D., Finan, S., Gainer, V., Shadick, N.A. and Murphy, S., 2017, Association of Intracranial Aneurysm Rupture with Smoking Duration, Intensity, and Cessation. Neurology 89(13),1408-1415.
  • McCoy Jr, TH., Yu, S., Hart, K.L., Castro, V.M., Brown, H.E., Rosenquist, J.N., Doyle, A.E., Vuijk, P.J., Cai, T. and Perlis, R.H., 2018. High Throughput Phenotyping for Dimensional Psychopathology in Electronic Health Records. Biological Psychiatry (2018), 83(12), 997-1004.
  • McCoy Jr, TH., Castro, V.M., Hart, K.L., Pellegrini, A.M., Yu,S., Cai, T. and Perlis, R.H.,2018. Genome-wide Association Study of Dimensional Psychopathology Using Electronic Health Records. Biological Psychiatry, 83(12), 1005-1011.
  • Liu, H., and Yu,B., 2017. Comments on: High dimensional simultaneous inference with the bootstrap. Test 26(4), 740-750.
  • Lin, Q., Zhao, Z., and Liu, J., 2018. On consistency and sparsity of sliced inverse regression in high dimensions. Annals of Statistics 46(2), 580-610.

 

2016.8-2017.7
  • Shao, Q. and Yang, L. (2017) Oracally efficient estimation and consistent model selection for auto-regressive moving average time series with trend. Journal of the Royal Statistical Society Series B 79(2), 507-524.
  • Zheng, S., Liu, R., Yang, L. and Härdle, W. (2016) Statistical inference for generalized additive models: simultaneous confidence corridors and variable selection. TEST 25(4), 607-626.
  • Wang, J., Wang, S., and Yang, L. (2016) Simultaneous confidence bands for the distribution function of a finite population and its superpopulation. TEST25(4), 692-709.
  • Li, D. and Tong, H. (2016) Nested sub-sample search algorithm for estimation of threshold models. Statistica Sinica 26(4), 1543-1554.
  • Hou L., Sun N., Mane S., et al. (2016) Impact of genotyping errors on statistical power of association tests in genomic analyses: A case study. Genetic Epidemiology 41(2):152-162.
  • Yong F.H., Tian L., Yu S., Cai T. and Wei L.J. (2016) Optimal stratification in outcome prediction using baseline information; Biometrika, 103.4: 817-828.
  • Castro V.M., Dligach D., Finan S., Yu S., Can A., Abd-El-Barr M., Gainer V.S., Shadick N.A., Murphy S.N., Cai T., Savova G., Weiss S.T., Du R. (2017) Large-scale identification of subjects with cerebral aneurysms using natural language processing. Neurology 88(2),164-168.
  • Yu S., Chakrabortty A., Liao K.P., Cai T., Ananthakrishnan A.N., Gainer V.S., Churchill S.E., Szolovits P., Murphy S.N., Kohane I.S., Cai T. (2017) Surrogate-assisted Feature Extraction for High-throughput Phenotyping. Journal of the American Medical Informatics Association 24(el), e143-e149

 

2015.7-2016.7
  • Shao Q. and Yang L. (2016) Oracally effcient estimation and consistent model selection for auto-regressive moving average time series with trend. Journal of the Royal Statistical Society Series B. DOI: 10.1111/rssb.12170.
  • Wang J., Wang S., and Yang L. (2016) Simultaneous confdence bands for the distribution function of a fnite population and its superpopulation. TEST 25(4), 692-709
  • Zheng S., Liu R., Yang L. and Härdle W. (2016) Statistical inference for generalized additive models: simultaneous confdence corridors and variable selection. TEST  25(4), 607-626
  • Yang M., Xue L. and Yang L. (2016) Variable selection for additive model via cumulative ratios of empirical strengths total. Journal of Nonparametric Statistics 28(3), 595-616.
  • Wu H. and Zhu Y. (2016) Deconvolution of base pair level RNA-Seq read counts for quantification of transcript expression levels. Annals of Applied Statistics. (To Appear)
  • 邓柯,陈孟裕,金锋,焦阳,丛林晔,罗季阳,殷杰(2016)中国进口食品风险评估的统计学方法。 《数理统计与管理》 ,已接收。
  • Deng K., Bol P.K., Li K.J., and Liu J.S. (2016) On unsupervised Chinese text mining. Online published in Proceedings of the National Academy of Sciences of USA. DOI: 10.1073/pnas.1516510113.
  • Zang C, Wang T., Deng K., et al (2016) High-dimensional genomic data bias correction and data integration using MANCIE. Online published in Nature Communications. DOI: 10.1038/ncomms11305.
  • Deng K., Li Y., Zhu W., and Liu J.S. (2016) Fast parameter estimation in loss tomography for networks of general topology. Online published in Annals of Applied Statistics. DOI: 10.1214/15-AOAS883
  • Li D., Ling S. and Zakoïan J.M. (2015) Asymptotic inference in multiple-threshold double autoregressive models. Journal of Econometrics 189, 415-427.
  • Li D., Ling S., and Zhang R.M. (2016) On a threshold double autoregressive model. Journal of Business & Economic Statistics 34, 68-80.
  • Li D., and Tong H. (2016) Nested sub-sample search algorithm for estimation of threshold models. Statistica Sinica. 26,4, 1543-1554.
  • Liu F., Li D., and Kang X.M. (2016) Sample path properties of an explosive double autoregressive model.Econometric Reviews. (To Appear)
  • Evans B., Gloria-Soria A., Hou L., McBride C., Bonizzoni M., Zhao H., Powell J. (2015) A multipurpose,high-throughput single-nucleotide polymorphism chip for the Dengue and Yellow Fever Mosquito, Aedes aegyptiG3, 3(5): 711-718.
  • Castro V., Shen Y., Yu S., Finan S., Pau C.T., Gainer V., Keefe C.C., Savova G., Murphy S.N., Cai T., Welt CK.(2015) Identifcation of subjects with polycystic ovary syndrome using electronic health records. Reproductive Biology and Endocrinology 13(1):1.
  • Cai T., Giannopoulos A.A., Yu S., Kelil T., Ripley B, Kumamaru K.K., Rybicki F.J., and Mitsouras D.*. (2016) Natural Language Processing Technologies in Radiology Research and Clinical Applications. RadioGraphics, 36(1): 176-191.
#post-11993
ntry-header

#post-11992
ntry-header

#post-11991
ntry-header

 

在大数据迅猛发展的时代背景之下,各行各业对统计学和数据科学专业人才的需求不断增加。清华大学统计学研究中心于2018年秋季学期开展第二期统计与数据科学研修班,为有志于从事数据处理、挖掘、分析等工作的人士提供方法和技术培训,同时也为有意在相关领域继续深造的人士奠定坚实的理论和应用基础。研修班项目信息在业界一经发布,反响极其热烈,报名、咨询的人员达百余人次。经过多轮面试和层层筛选,最终确定了三十名优秀学员在清华大学进行为期一年的课程的学习。

 

中心 李东教授
班主任 邓婉璐老师
中心 俞声教授

 

2018年9月20日,清华大学统计与数据科学研修班在清华大学舜德楼412会议室顺利开班。由中心李东教授主持并介绍清华大学统计学研究中心整体情况,班主任邓婉璐老师为大家详细讲解了在校研修期间的各项事宜,内容详尽周到,涵盖了在清华学习和生活的方方面面,随后中心俞声教授主讲了本期研修班的第一次课程《统计计算》,为同学们敲开了统计科学的大门。

 

研修班班会现场
#post-11990
ntry-header

2018年9月17日,统计学论坛在清华大学伟清楼209成功举办。受中心杨立坚教授邀请,中国人民大学统计与大数据研究院院长艾春荣教授访问我中心并作主题为“A Unified Framework for Efficient Estimation of General Treatment Models”的特邀报告。本次学术报告由俞声教授主持。

艾春荣教授,华中科技大学应用数学硕士,美国麻省理工学院经济学博士,现任中国人民大学统计与大数据研究院院长。艾教授长期从事计量经济学理论与方法、实证产业经济、实证金融、中国经济的教学和科研工作,主持或主持过国家自然基金面上项目3项,参与国家自科基金重点项目1项,在国际主要经济学期刊上发表论文四十余篇。

#post-11989
ntry-header

#post-11988
ntry-header

2018年9月10日,时值教师节之际,【统计学论坛·特邀报告】在清华大学伟清楼209成功举办。报告邀请到来自美国宾夕法尼亚大学的Qi Long教授。这次报告由清华大学统计学研究中心俞声老师主持,题目是“Variable Selection for Structured High-dimensional Data Using Known and Novel Graph Information”。

#post-11987
ntry-header

#post-11986