当前位置:首页 > 学术交流
“数字+”与统计数据工程系列讲座(二十七)3月29日中国科学院研究员王启华教授来我院讲座预告
发布日期:2023-03-27 阅读:
题目: A robust fusion-extraction procedure with summary statistics in the presence of biased sources

汇报人:王启华

会议时间:2023年3月29日(周三)10:00-11:00

地点:综合楼644会议室

报告人简介:

王启华,中国科学院数学与系统科学研究院研究员,博士生导师,国家杰出青年基金获得者,教育部长江学者奖励计划特聘教授,中科院“百人计划”入选者。曾在北京大学、香港大学任教及在深圳大学与浙江工商大学任特聘教授,先后访问加拿大、美国、德国及澳大利亚10多所世界一流大学。主要从事复杂数据经验似然统计推断、缺失数据分析、高维数据统计分析、大规模数据分析等方面的研究,出版专著三部,在The Annals of Statistics,  JASA及Biometrika等国际重要刊物发表论文140余篇,部分工作已产生持久的学术影响。曾主持国家自然科学基金委国家杰出青年基金项目、重点项目、多项面上项目,作为核心骨干成员先后参加了两项国家自然科学基金创新群体项目。是高维统计分会理事长,生存分析分会副理事长,中国现场统计研究会常务理事,中国概率统计学会常务理事,曾任或现任《中国科学》(中英文版)(2005-2012)、Electronic Research Archive、Ann. Inst. Stat. Math、Biostatistics & Epidemiology及《应用数学学报》英文版等刊物及《现代数学基础丛书》与《统计与数据科学丛书》的编委。

摘要:

Information from multiple data sources is increasingly available. However, some data sources may produce biased estimates due to biased sampling, data corruption, or model misspecification. This calls for robust data combination methods with biased sources.  In this paper, a robust data fusion-extraction method is proposed. In contrast to existing methods, the proposed method can be applied to the important case where researchers have no knowledge of which data sources are unbiased. The proposed estimator is easy to compute and only employs summary statistics, and hence can be applied to many different fields, e.g., meta-analysis, Mendelian randomization, and distributed systems. The proposed estimator is consistent even if many data sources are biased and is asymptotically equivalent to the oracle estimator that only uses unbiased data. Asymptotic normality of the proposed estimator is also established. In contrast to the existing meta-analysis methods, the theoretical properties are guaranteed even if the number of data sources and the dimension of the parameter diverges as the sample size increases. Furthermore, the proposed method provides a consistent selection for unbiased data sources with probability approaching one. Simulation studies demonstrate the efficiency and robustness of the proposed method empirically. The proposed method is applied to a meta-analysis data set to evaluate the surgical treatment for moderate periodontal disease and to a Mendelian randomization data set to study the risk factors of head and neck cancer.
上一条: “数字+百家讲坛” 4月6日施建军教授来我校讲座预告 2023-03-29
下一条: “数字+”与之江统计讲坛(第30讲)3月27日美国肯塔基大学秦菁来我院讲座预告 2023-03-20

友情链接: 浙江工商大学统计学院 |  中国人民大学统计学院 |  厦门大学计划统计系 |  中国统计学会 | 

版权所有 ©2017 浙江工商大学统计学院 All Right Reserver. Email:tjx@zjgsu.edu.cn 技术支持:名冠电子商务
地址:浙江省杭州市下沙高教园区学正街18号 联系电话:(86)571-28008085 浙ICP备15014656号 浙公网安备33011802000512号