This paper reviews empirical validation studies of the application of multizone indoor air quality (IAQ) models to residential-scale buildings. This review focuses on empirical verification efforts, although models have also been subjected to analytical verification and inter-model comparisons. In most reports, experimental data were compared to predictions of only one model - typically, either the CONTAM or COMIS models. However, inter-model comparisons have demonstrated consistency between these and other multizone models so most comparisons can be generalized to all multizone models. Few of the empirical verifications reported statistical analyses of the comparison between measurements and predictions. Where sufficient data were available in the literature, additional statistical analyses have been performed and reported. Also, most published reports did not address the issue of measurement uncertainty. No single reported multizone IAQ model validation effort can be considered to be complete due to limitations in scope, inadequate detail describing experimental and/or modeling procedures, lack of rigorous statistical analysis, inclusion of only small ranges of airflows and concentrations, questions on independence of validation datasets, and other shortcomings. However, if one considers the body of published validation work, it may be concluded that a knowledgeable user can expect to make reasonable predictions of air change rates, interzonal flows, and contaminant concentrations for residential-scale buildings dominated by stack-driven or ventilation flows with inert pollutants. In contrast, more work is clearly needed for applications with high wind speeds, reactive contaminants, or specialized situations such as ambient Additionally, future model validation efforts will be more useful if more statistical analyses are performed and if more detail on both the measurements and modeling are reported.Units: SI