Semi-Supervised Deep Sobolev Regression: Estimation and Variable Selection by ReQU Neural Network
Abstract: We propose SDORE, a Semi-supervised Deep Sobolev Regressor, for the nonparametric estimation of the underlying regression function and its gradient. SDORE employs deep ReQU neural networks to minimize the empirical risk with gradient norm regularization, allowing the approximation of the regularization term by unlabeled data. Our study includes a thorough analysis of the convergence rates of SDORE in $L{2}$-norm, achieving the minimax optimality. Further, we establish a convergence rate for the associated plug-in gradient estimator, even in the presence of significant domain shift. These theoretical findings offer valuable insights for selecting regularization parameters and determining the size of the neural network, while showcasing the provable advantage of leveraging unlabeled data in semi-supervised learning. To the best of our knowledge, SDORE is the first provable neural network-based approach that simultaneously estimates the regression function and its gradient, with diverse applications such as nonparametric variable selection. The effectiveness of SDORE is validated through an extensive range of numerical simulations.
- {barticle}[author] \bauthor\bsnmAmiri, \bfnmAboubacar\binitsA. and \bauthor\bsnmThiam, \bfnmBaba\binitsB. (\byear2018). \btitleRegression estimation by local polynomial fitting for multivariate data streams. \bjournalStatistical Papers \bvolume59 \bpages813–843. \bdoihttps://doi.org/10.1007/s00362-016-0791-6 \endbibitem
- {barticle}[author] \bauthor\bsnmAtmadja, \bfnmJuliana\binitsJ. and \bauthor\bsnmBagtzoglou, \bfnmAmvrossios C.\binitsA. C. (\byear2001). \btitleState of the art report on mathematical methods for groundwater pollution source identification. \bjournalEnvironmental Forensics \bvolume2 \bpages205-214. \bdoihttps://doi.org/10.1006/enfo.2001.0055 \endbibitem
- {barticle}[author] \bauthor\bsnmBadia, \bfnmA. El\binitsA. E. and \bauthor\bsnmHa-Duong, \bfnmT.\binitsT. (\byear2002). \btitleOn an inverse source problem for the heat equation. Application to a pollution detection problem. \bjournalJournal of Inverse and Ill-posed Problems \bvolume10 \bpages585–599. \bdoidoi:10.1515/jiip.2002.10.6.585 \endbibitem
- {barticle}[author] \bauthor\bsnmBadia, \bfnmA. El\binitsA. E. and \bauthor\bsnmHajj, \bfnmA. El\binitsA. E. (\byear2013). \btitleIdentification of dislocations in materials from boundary measurements. \bjournalSIAM Journal on Applied Mathematics \bvolume73 \bpages84-103. \bdoi10.1137/110833920 \endbibitem
- {barticle}[author] \bauthor\bsnmBagby, \bfnmThomas\binitsT., \bauthor\bsnmBos, \bfnmLen P.\binitsL. P. and \bauthor\bsnmLevenberg, \bfnmNorman\binitsN. (\byear2002). \btitleMultivariate simultaneous approximation. \bjournalConstructive Approximation \bvolume18 \bpages569-577. \endbibitem
- {barticle}[author] \bauthor\bsnmBanerjee, \bfnmSudipto\binitsS., \bauthor\bsnmGelfand, \bfnmAlan E\binitsA. E. and \bauthor\bsnmSirmans, \bfnmC. F\binitsC. F. (\byear2003). \btitleDirectional rates of change under spatial process models. \bjournalJournal of the American Statistical Association \bvolume98 \bpages946-954. \bdoi10.1198/C16214503000000909 \endbibitem
- {barticle}[author] \bauthor\bsnmBauer, \bfnmBenedikt\binitsB. and \bauthor\bsnmKohler, \bfnmMichael\binitsM. (\byear2019). \btitleOn Deep Learning as a Remedy for the Curse of Dimensionality in Nonparametric Regression. \bjournalThe Annals of Statistics \bvolume47 \bpages2261–2285. \endbibitem
- {barticle}[author] \bauthor\bsnmBelkin, \bfnmMikhail\binitsM., \bauthor\bsnmNiyogi, \bfnmPartha\binitsP. and \bauthor\bsnmSindhwani, \bfnmVikas\binitsV. (\byear2006). \btitleManifold regularization: A geometric framework for learning from labeled and unlabeled examples. \bjournalJournal of Machine Learning Research \bvolume7 \bpages2399-2434. \endbibitem
- {bbook}[author] \bauthor\bsnmBellman, \bfnmRichard E.\binitsR. E. (\byear1961). \btitleAdaptive Control Processes: A Guided Tour. \bpublisherPrinceton University Press, \baddressPrinceton. \bdoidoi:10.1515/9781400874668 \endbibitem
- {barticle}[author] \bauthor\bsnmChen, \bfnmZhiming\binitsZ., \bauthor\bsnmTuo, \bfnmRui\binitsR. and \bauthor\bsnmZhang, \bfnmWenlong\binitsW. (\byear2018). \btitleStochastic convergence of a nonconforming finite element method for the thin plate spline smoother for observational data. \bjournalSIAM Journal on Numerical Analysis \bvolume56 \bpages635-659. \bdoi10.1137/16M109630X \endbibitem
- {barticle}[author] \bauthor\bsnmChen, \bfnmZhiming\binitsZ., \bauthor\bsnmZhang, \bfnmWenlong\binitsW. and \bauthor\bsnmZou, \bfnmJun\binitsJ. (\byear2022). \btitleStochastic convergence of regularized solutions and their finite element approximations to inverse source problems. \bjournalSIAM Journal on Numerical Analysis \bvolume60 \bpages751-780. \bdoi10.1137/21M1409779 \endbibitem
- {barticle}[author] \bauthor\bsnmDai, \bfnmXiongtao\binitsX., \bauthor\bsnmMüller, \bfnmHans-Georg\binitsH.-G. and \bauthor\bsnmTao, \bfnmWenwen\binitsW. (\byear2018). \btitleDerivative principal component analysis for representing the time dynamics of longitudinal and functional data. \bjournalStatistica Sinica \bvolume28 \bpages1583–1609. \endbibitem
- {bbook}[author] \bauthor\bsnmDeVore, \bfnmRonald A.\binitsR. A. and \bauthor\bsnmLorentz, \bfnmGeorge G.\binitsG. G. (\byear1993). \btitleConstructive Approximation. \bseriesGrundlehren der mathematischen Wissenschaften \bvolume303. \bpublisherSpringer Berlin, Heidelberg. \endbibitem
- {binproceedings}[author] \bauthor\bsnmDrucker, \bfnmHarris\binitsH. and \bauthor\bsnmLe Cun, \bfnmYann\binitsY. (\byear1991). \btitleDouble backpropagation increasing generalization performance. In \bbooktitleIJCNN-91-Seattle International Joint Conference on Neural Networks \bvolumeii \bpages145-150 vol.2. \bdoi10.1109/IJCNN.1991.155328 \endbibitem
- {barticle}[author] \bauthor\bsnmDrucker, \bfnmHarris\binitsH. and \bauthor\bsnmLe Cun, \bfnmYann\binitsY. (\byear1992). \btitleImproving generalization performance using double backpropagation. \bjournalIEEE Transactions on Neural Networks \bvolume3 \bpages991-997. \bdoi10.1109/72.165600 \endbibitem
- {barticle}[author] \bauthor\bsnmEpperson, \bfnmJames F.\binitsJ. F. (\byear1987). \btitleOn the Runge example. \bjournalThe American Mathematical Monthly \bvolume94 \bpages329-341. \bdoi10.1080/00029890.1987.12000642 \endbibitem
- {bbook}[author] \bauthor\bsnmEvans, \bfnmLawrence C.\binitsL. C. (\byear2010). \btitlePartial differential equations, \beditionSecond ed. \bseriesGraduate Studies in Mathematics \bvolume19. \bpublisherAmerican Mathematical Society (AMS). \endbibitem
- {barticle}[author] \bauthor\bsnmFarrell, \bfnmMax H\binitsM. H., \bauthor\bsnmLiang, \bfnmTengyuan\binitsT. and \bauthor\bsnmMisra, \bfnmSanjog\binitsS. (\byear2021). \btitleDeep Neural Networks for Estimation and Inference. \bjournalEconometrica \bvolume89 \bpages181–213. \endbibitem
- {barticle}[author] \bauthor\bsnmFlorens, \bfnmJean-Pierre\binitsJ.-P., \bauthor\bsnmIvaldi, \bfnmMarc\binitsM. and \bauthor\bsnmLarribeau, \bfnmSophie\binitsS. (\byear1996). \btitleSobolev Estimation of Approximate Regressions. \bjournalEconometric Theory \bvolume12 \bpages753–772. \bdoi10.1017/S0266466600007143 \endbibitem
- {bbook}[author] \bauthor\bsnmGoodfellow, \bfnmIan\binitsI., \bauthor\bsnmBengio, \bfnmYoshua\binitsY. and \bauthor\bsnmCourville, \bfnmAaron\binitsA. (\byear2016). \btitleDeep Learning. \bpublisherMIT Press. \endbibitem
- {bbook}[author] \bauthor\bsnmGriewank, \bfnmAndreas\binitsA. and \bauthor\bsnmWalther, \bfnmAndrea\binitsA. (\byear2008). \btitleEvaluating derivatives: Principles and techniques of algorithmic differentiation, \beditionSecond ed. \bseriesOther Titles in Applied Mathematics. \bpublisherSociety for Industrial and Applied Mathematics (SIAM). \bdoi10.1137/1.9780898717761 \endbibitem
- {bbook}[author] \bauthor\bsnmGrisvard, \bfnmPierre\binitsP. (\byear2011). \btitleElliptic Problems in Nonsmooth Domains. \bpublisherSociety for Industrial and Applied Mathematics (SIAM). \bdoi10.1137/1.9781611972030 \endbibitem
- {barticle}[author] \bauthor\bsnmPeter Hall and Adonis Yatchew (\byear2007). \btitleNonparametric estimation when data on derivatives are available. \bjournalThe Annals of Statistics \bvolume35 \bpages300–323. \endbibitem
- {barticle}[author] \bauthor\bsnmHall, \bfnmPeter\binitsP. and \bauthor\bsnmYatchew, \bfnmAdonis\binitsA. (\byear2010). \btitleNonparametric least squares estimation in derivative families. \bjournalJournal of Econometrics \bvolume157 \bpages362-374. \bdoihttps://doi.org/10.1016/j.jeconom.2010.03.038 \endbibitem
- {barticle}[author] \bauthor\bsnmHashem, \bfnmSherif\binitsS. (\byear1997). \btitleOptimal linear combinations of neural networks. \bjournalNeural Networks \bvolume10 \bpages599-614. \bdoihttps://doi.org/10.1016/S0893-6080(96)00098-6 \endbibitem
- {barticle}[author] \bauthor\bsnmHeckman, \bfnmNancy E.\binitsN. E. and \bauthor\bsnmRamsay, \bfnmJames O.\binitsJ. O. (\byear2000). \btitlePenalized regression with model-based penalties. \bjournalCanadian Journal of Statistics \bvolume28 \bpages241-258. \bdoihttps://doi.org/10.2307/3315976 \endbibitem
- {bbook}[author] \bauthor\bsnmIsakov, \bfnmVictor\binitsV. (\byear2018). \btitleInverse problems for partial differential equations, \beditionthird ed. \bseriesApplied Mathematical Sciences \bvolume127. \bpublisherSpringer Cham. \bdoihttps://doi.org/10.1007/978-3-319-51658-5 \endbibitem
- {barticle}[author] \bauthor\bsnmJiao, \bfnmYuling\binitsY., \bauthor\bsnmWang, \bfnmYang\binitsY. and \bauthor\bsnmYang, \bfnmYunfei\binitsY. (\byear2023). \btitleApproximation bounds for norm constrained neural networks with applications to regression and GANs. \bjournalApplied and Computational Harmonic Analysis \bvolume65 \bpages249-278. \bdoihttps://doi.org/10.1016/j.acha.2023.03.004 \endbibitem
- {barticle}[author] \bauthor\bsnmKohler, \bfnmMichael\binitsM. and \bauthor\bsnmKrzyżak, \bfnmAdam\binitsA. (\byear2001). \btitleNonparametric regression estimation using penalized least squares. \bjournalIEEE Transactions on Information Theory \bvolume47 \bpages3054-3058. \bdoi10.1109/18.998089 \endbibitem
- {barticle}[author] \bauthor\bsnmKohler, \bfnmMichael\binitsM., \bauthor\bsnmKrzyżak, \bfnmAdam\binitsA. and \bauthor\bsnmSchäfer, \bfnmDominik\binitsD. (\byear2002). \btitleApplication of structural risk minimization to multivariate smoothing spline regression estimates. \bjournalBernoulli \bvolume8 \bpages475 – 489. \endbibitem
- {barticle}[author] \bauthor\bsnmKohler, \bfnmMichael\binitsM., \bauthor\bsnmKrzyżak, \bfnmAdam\binitsA. and \bauthor\bsnmLanger, \bfnmSophie\binitsS. (\byear2022). \btitleEstimation of a Function of Low Local Dimensionality by Deep Neural Networks. \bjournalIEEE transactions on information theory \bvolume68 \bpages4032–4042. \endbibitem
- {barticle}[author] \bauthor\bsnmKohler, \bfnmMichael\binitsM. and \bauthor\bsnmLanger, \bfnmSophie\binitsS. (\byear2021). \btitleOn the Rate of Convergence of Fully Connected Deep Neural Network Regression Estimates. \bjournalThe Annals of Statistics \bvolume49 \bpages2231–2249. \endbibitem
- {barticle}[author] \bauthor\bsnmLi, \bfnmBo\binitsB., \bauthor\bsnmTang, \bfnmShanshan\binitsS. and \bauthor\bsnmYu, \bfnmHaijun\binitsH. (\byear2019). \btitleBetter approximations of high dimensional smooth functions by deep neural networks with rectified power units. \bjournalCommunications in Computational Physics \bvolume27 \bpages379–411. \bdoihttps://doi.org/10.4208/cicp.OA-2019-0168 \endbibitem
- {barticle}[author] \bauthor\bsnmLi, \bfnmBo\binitsB., \bauthor\bsnmTang, \bfnmShanshan\binitsS. and \bauthor\bsnmYu, \bfnmHaijun\binitsH. (\byear2020). \btitlePowerNet: Efficient representations of polynomials and smooth functions by deep neural networks with rectified power units. \bjournalJournal of Mathematical Study \bvolume53 \bpages159–191. \bdoihttps://doi.org/10.4208/jms.v53n2.20.03 \endbibitem
- {barticle}[author] \bauthor\bsnmLiu, \bfnmYu\binitsY. and \bauthor\bsnmBrabanter, \bfnmKris De\binitsK. D. (\byear2020). \btitleSmoothed nonparametric derivative estimation using weighted difference quotients. \bjournalJournal of Machine Learning Research \bvolume21 \bpages1-45. \endbibitem
- {barticle}[author] \bauthor\bsnmLiu, \bfnmZejian\binitsZ. and \bauthor\bsnmLi, \bfnmMeng\binitsM. (\byear2023). \btitleOn the estimation of derivatives using plug-in kernel ridge regression estimators. \bjournalJournal of Machine Learning Research \bvolume24 \bpages1–37. \endbibitem
- {bmisc}[author] \bauthor\bsnmLiu, \bfnmRuiqi\binitsR., \bauthor\bsnmLi, \bfnmKexuan\binitsK. and \bauthor\bsnmLi, \bfnmMeng\binitsM. (\byear2023). \btitleEstimation and hypothesis testing of derivatives in smoothing spline ANOVA models. \endbibitem
- {barticle}[author] \bauthor\bsnmLivne, \bfnmIlan\binitsI., \bauthor\bsnmAzriel, \bfnmDavid\binitsD. and \bauthor\bsnmGoldberg, \bfnmYair\binitsY. (\byear2022). \btitleImproved estimators for semi-supervised high-dimensional regression model. \bjournalElectronic Journal of Statistics \bvolume16 \bpages5437 – 5487. \bdoi10.1214/22-EJS2070 \endbibitem
- {barticle}[author] \bauthor\bsnmMasry, \bfnmElias\binitsE. (\byear1996a). \btitleMultivariate local polynomial regression for time series: Uniform strong consistency and rates. \bjournalJournal of Time Series Analysis \bvolume17 \bpages571-599. \bdoihttps://doi.org/10.1111/j.1467-9892.1996.tb00294.x \endbibitem
- {barticle}[author] \bauthor\bsnmMasry, \bfnmElias\binitsE. (\byear1996b). \btitleMultivariate regression estimation local polynomial fitting for time series. \bjournalStochastic Processes and their Applications \bvolume65 \bpages81-101. \bdoihttps://doi.org/10.1016/S0304-4149(96)00095-6 \endbibitem
- {barticle}[author] \bauthor\bsnmMendelson, \bfnmShahar\binitsS. (\byear2001). \btitleOn the size of convex hulls of small sets. \bjournalJournal of Machine Learning Research \bpages1–18. \endbibitem
- {barticle}[author] \bauthor\bsnmBarati Moghaddam, \bfnmMaryam\binitsM., \bauthor\bsnmMazaheri, \bfnmMehdi\binitsM. and \bauthor\bsnmMohammad Vali Samani, \bfnmJamal\binitsJ. (\byear2021). \btitleInverse modeling of contaminant transport for pollution source identification in surface and groundwaters: a review. \bjournalGroundwater for Sustainable Development \bvolume15 \bpages100651. \bdoihttps://doi.org/10.1016/j.gsd.2021.100651 \endbibitem
- {barticle}[author] \bauthor\bsnmMüller, \bfnmHans-Georg\binitsH.-G., \bauthor\bsnmStadtmüller, \bfnmU.\binitsU. and \bauthor\bsnmSchmitt, \bfnmThomas\binitsT. (\byear1987). \btitleBandwidth choice and confidence intervals for derivatives of noisy data. \bjournalBiometrika \bvolume74 \bpages743–749. \endbibitem
- {barticle}[author] \bauthor\bsnmMüller, \bfnmHans-Georg\binitsH.-G. and \bauthor\bsnmYao, \bfnmFang\binitsF. (\byear2010). \btitleAdditive modelling of functional gradients. \bjournalBiometrika \bvolume97 \bpages791-805. \bdoi10.1093/biomet/asq056 \endbibitem
- {barticle}[author] \bauthor\bsnmNakada, \bfnmRyumei\binitsR. and \bauthor\bsnmImaizumi, \bfnmMasaaki\binitsM. (\byear2020). \btitleAdaptive approximation and generalization of deep neural network with intrinsic dimensionality. \bjournalJournal of Machine Learning Research \bvolume21 \bpages1-38. \endbibitem
- {barticle}[author] \bauthor\bsnmNeidinger, \bfnmRichard D.\binitsR. D. (\byear2010). \btitleIntroduction to automatic differentiation and MATLAB object-oriented programming. \bjournalSIAM Review \bvolume52 \bpages545-563. \bdoi10.1137/080743627 \endbibitem
- {binproceedings}[author] \bauthor\bsnmNewell, \bfnmJohn\binitsJ. and \bauthor\bsnmEinbeck, \bfnmJochen\binitsJ. (\byear2007). \btitleA comparative study of nonparametric derivative estimators. In \bbooktitle22nd International Workshop on Statistical Modelling. \bseriesProceedings of the IWSM \bpages453-456. \endbibitem
- {barticle}[author] \bauthor\bsnmNickl, \bfnmRichard\binitsR., \bauthor\bparticlevan de \bsnmGeer, \bfnmSara\binitsS. and \bauthor\bsnmWang, \bfnmSven\binitsS. (\byear2020). \btitleConvergence rates for penalized least squares estimators in PDE constrained regression problems. \bjournalSIAM/ASA Journal on Uncertainty Quantification \bvolume8 \bpages374-413. \bdoi10.1137/18M1236137 \endbibitem
- {barticle}[author] \bauthor\bsnmOrorbia II, \bfnmAlexander G.\binitsA. G., \bauthor\bsnmKifer, \bfnmDaniel\binitsD. and \bauthor\bsnmGiles, \bfnmC. Lee\binitsC. L. (\byear2017). \btitleUnifying adversarial training algorithms with data gradient regularization. \bjournalNeural Computation \bvolume29 \bpages867-887. \bdoi10.1162/NECO_a_00928 \endbibitem
- {barticle}[author] \bauthor\bsnmPetersen, \bfnmPhilipp\binitsP. and \bauthor\bsnmVoigtlaender, \bfnmFelix\binitsF. (\byear2018). \btitleOptimal Approximation of Piecewise Smooth Functions using Deep ReLU Neural Networks. \bjournalNeural Networks \bvolume108 \bpages296–330. \endbibitem
- {bbook}[author] \bauthor\bsnmRamsay, \bfnmJames O.\binitsJ. O. and \bauthor\bsnmSilverman, \bfnmBernard W.\binitsB. W. (\byear2002). \btitleApplied Functional Data Analysis: Methods and Case Studies. \bseriesSpringer Series in Statistics (SSS). \bpublisherSpringer New York, NY. \bdoihttps://doi.org/10.1007/b98886 \endbibitem
- {barticle}[author] \bauthor\bsnmRondonotti, \bfnmVitaliana\binitsV., \bauthor\bsnmMarron, \bfnmJ. S.\binitsJ. S. and \bauthor\bsnmPark, \bfnmCheolwoo\binitsC. (\byear2007). \btitleSiZer for time series: A new approach to the analysis of trends. \bjournalElectronic Journal of Statistics \bvolume1 \bpages268 – 289. \bdoi10.1214/07-EJS006 \endbibitem
- {barticle}[author] \bauthor\bsnmSchmidt-Hieber, \bfnmJohannes\binitsJ. (\byear2020). \btitleNonparametric Regression using Deep Neural Networks with ReLU Activation Function. \bjournalThe Annals of Statistics \bvolume48 \bpages1875–1897. \endbibitem
- {barticle}[author] \bauthor\bsnmShen, \bfnmZuowei\binitsZ. (\byear2020). \btitleDeep Network Approximation Characterized by Number of Neurons. \bjournalCommunications in Computational Physics \bvolume28 \bpages1768–1811. \endbibitem
- {barticle}[author] \bauthor\bsnmShen, \bfnmZuowei\binitsZ., \bauthor\bsnmYang, \bfnmHaizhao\binitsH. and \bauthor\bsnmZhang, \bfnmShijun\binitsS. (\byear2019). \btitleNonlinear Approximation via Compositions. \bjournalNeural Networks \bvolume119 \bpages74–84. \endbibitem
- {bbook}[author] \bauthor\bsnmShephard, \bfnmRonald W.\binitsR. W. (\byear1981). \btitleCost and Production Functions. \bseriesLecture Notes in Economics and Mathematical Systems \bvolume194. \bpublisherSpringer Berlin, Heidelberg. \bdoihttps://doi.org/10.1007/978-3-642-51578-1 \endbibitem
- {barticle}[author] \bauthor\bsnmSong, \bfnmShanshan\binitsS., \bauthor\bsnmLin, \bfnmYuanyuan\binitsY. and \bauthor\bsnmZhou, \bfnmYong\binitsY. (\byear2023). \btitleA general M-estimation theory in semi-supervised framework. \bjournalJournal of the American Statistical Association \bvolume0 \bpages1-11. \bdoi10.1080/01621459.2023.2169699 \endbibitem
- {barticle}[author] \bauthor\bsnmStone, \bfnmCharles J.\binitsC. J. (\byear1982). \btitleOptimal global rates of convergence for nonparametric regression. \bjournalThe Annals of Statistics \bvolume10 \bpages1040 – 1053. \bdoi10.1214/aos/1176345969 \endbibitem
- {barticle}[author] \bauthor\bsnmStone, \bfnmCharles J.\binitsC. J. (\byear1985). \btitleAdditive regression and other nonparametric models. \bjournalThe Annals of Statistics \bvolume13. \bdoi10.1214/aos/1176349548 \endbibitem
- {bbook}[author] \bauthor\bsnmTao, \bfnmTerence\binitsT. (\byear2022). \btitleAnalysis II, \beditionFourth ed. \bseriesTexts and Readings in Mathematics (TRIM) \bvolume38. \bpublisherSpringer Singapore. \bdoihttps://doi.org/10.1007/978-981-19-7284-3 \endbibitem
- {bbook}[author] \bauthor\bsnmTröltzsch, \bfnmFredi\binitsF. (\byear2010). \btitleOptimal control of partial differential equations: Theory, methods and applications. \bseriesGraduate Studies in Mathematics \bvolume112. \bpublisherAmerican Mathematical Society (AMS). \endbibitem
- {bbook}[author] \bauthor\bsnmTsybakov, \bfnmAlexandre B.\binitsA. B. (\byear2009). \btitleIntroduction to Nonparametric Estimation. \bseriesSpringer Series in Statistics (SSS). \bpublisherSpringer New York, NY. \bdoihttps://doi.org/10.1007/b13794 \endbibitem
- {bbook}[author] \bauthor\bsnmVaart, \bfnmAad W. van der\binitsA. W. v. d. and \bauthor\bsnmWellner, \bfnmJon A.\binitsJ. A. (\byear2023). \btitleWeak Convergence and Empirical Processes: With Applications to Statistics, \beditionSecond ed. \bseriesSpringer Series in Statistics (SSS). \bpublisherSpringer Cham. \bdoihttps://doi.org/10.1007/978-3-031-29040-4 \endbibitem
- {barticle}[author] \bauthor\bsnmVan Engelen, \bfnmJesper E\binitsJ. E. and \bauthor\bsnmHoos, \bfnmHolger H\binitsH. H. (\byear2020). \btitleA survey on semi-supervised learning. \bjournalMachine learning \bvolume109 \bpages373–440. \bdoihttps://doi.org/10.1007/s10994-019-05855-6 \endbibitem
- {bbook}[author] \bauthor\bsnmWahba, \bfnmGrace\binitsG. (\byear1990). \btitleSpline Models for Observational Data. \bseriesCBMS-NSF Regional Conference Series in Applied Mathematics. \bpublisherSociety for Industrial and Applied Mathematics (SIAM). \bdoi10.1137/1.9781611970128 \endbibitem
- {bbook}[author] \bauthor\bsnmWainwright, \bfnmMartin J\binitsM. J. (\byear2019). \btitleHigh-dimensional statistics: A non-asymptotic viewpoint \bvolume48. \bpublisherCambridge university press. \endbibitem
- {bbook}[author] \bauthor\bsnmWasserman, \bfnmLarry\binitsL. (\byear2006). \btitleAll of nonparametric statistics, \beditionFirst ed. \bseriesSpringer Texts in Statistics. \bpublisherSpringer New York. \bdoihttps://doi.org/10.1007/0-387-30623-4 \endbibitem
- {barticle}[author] \bauthor\bsnmX. Liu, \bfnmZ. Zhai\binitsZ. Z. (\byear2007). \btitleInverse modeling methods for indoor airborne pollutant tracking:literature review and fundamentals. \bjournalIndoor Air \bvolume17 \bpages419-438. \bdoihttps://doi.org/10.1111/j.1600-0668.2007.00497.x \endbibitem
- {barticle}[author] \bauthor\bsnmYang, \bfnmYuhong\binitsY. and \bauthor\bsnmBarron, \bfnmAndrew\binitsA. (\byear1999). \btitleInformation-theoretic determination of minimax rates of convergence. \bjournalThe Annals of Statistics \bvolume27 \bpages1564 – 1599. \bdoi10.1214/aos/1017939142 \endbibitem
- {binproceedings}[author] \bauthor\bsnmYarotsky, \bfnmDmitry\binitsD. (\byear2018). \btitleOptimal Approximation of Continuous Functions by Very Deep ReLU Networks. In \bbooktitleConference on learning theory \bpages639–649. \bpublisherPMLR. \endbibitem
- {barticle}[author] \bauthor\bsnmYarotsky, \bfnmDmitry\binitsD. and \bauthor\bsnmZhevnerchuk, \bfnmAnton\binitsA. (\byear2020). \btitleThe Phase Diagram of Approximation Rates for Deep Neural Networks. \bjournalAdvances in neural information processing systems \bvolume33 \bpages13005–13015. \endbibitem
- {binproceedings}[author] \bauthor\bsnmZhang, \bfnmTong\binitsT. (\byear2000). \btitleThe value of unlabeled data for classification problems. In \bbooktitleProceedings of the 17th International Conference on Machine Learning. \endbibitem
- {barticle}[author] \bauthor\bsnmZhou, \bfnmShanggang\binitsS. and \bauthor\bsnmWolfe, \bfnmDouglas A.\binitsD. A. (\byear2000). \btitleOn derivative estimation in spline regression. \bjournalStatistica Sinica. \endbibitem
- {bbook}[author] \bauthor\bsnmZhu, \bfnmXiaojin\binitsX. and \bauthor\bsnmGoldberg, \bfnmAndrew B.\binitsA. B. (\byear2009). \btitleIntroduction to semi-supervised learning, \beditionFirst ed. \bseriesSynthesis Lectures on Artificial Intelligence and Machine Learning (SLAIML). \bpublisherSpringer Cham. \bdoihttps://doi.org/10.1007/978-3-031-01548-9 \endbibitem
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.