{"main_topic":"education","sub_topic":"gosa","topic":"attendance","key":"education/gosa/attendance","version":"1.0.0","year_min":2004,"year_max":2024,"year_gaps":[],"update_frequency":"annual","detail_levels":["districts","schools","states"],"default_detail":"schools","path_template":"education/attendance/year={year}/{detail}.parquet","grain":["year","district_code","school_code","demographic"],"title":"Student Attendance and Chronic Absenteeism","summary":"Student attendance tiers and chronic-absence rates by Georgia school, district, and demographic subgroup, 2004-2024.","description":"Distribution of Georgia public-school students across three absentee tiers (5 or fewer, 6-15, and more than 15 days absent during the school year), plus a chronic-absence rate (10% or more of enrolled days; published from 2018) and the student count in the denominator, for every school with official district and state rollups, by demographic subgroup (race/ethnicity, gender, economic status, English proficiency, migrant status, disability status). Published by GOSA for school years 2003-04 through 2023-24.","source":"GOSA","source_url":"https://gosa.georgia.gov/dashboards-data-report-card/downloadable-data","endpoint":"/api/v1/education/attendance","columns":[{"name":"year","type":"integer","role":"year","nullable":false,"label":"Year","short_description":"","description":"Ending (spring) calendar year of the school year (e.g. 2024 for 2023-24). 2011-2024 files carry a LONG_SCHOOL_YEAR cross-checked against the filename; 2004-2010 files carry no year column, so the filename year (verified as the ending year via year-over-year enrollment continuity) is authoritative.","valid_values":null,"example":2024,"unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":false,"metric_component":null},{"name":"district_code","type":"string","role":"fk_district","nullable":true,"label":"District Code","short_description":"","description":"GOSA district code (FK to districts dimension): 3-digit zero-padded county/city codes or 7-digit state-charter codes (present from 2010). NULL on state-level rows.","valid_values":null,"example":"601","unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":false,"metric_component":null},{"name":"school_code","type":"string","role":"fk_school","nullable":true,"label":"School Code","short_description":"","description":"GOSA school code, zero-padded to 4 characters (composite FK to schools dimension with district_code; not globally unique on its own). 2004-2010 sources publish unpadded codes ('103'), zero-padded here to align with 2011-2024. NULL on district- and state-level rows.","valid_values":null,"example":"0103","unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":false,"metric_component":null},{"name":"demographic","type":"string","role":"fk_demographic","nullable":false,"label":"Demographic","short_description":"Student subgroup the row describes (race, gender, economic status, English learner, migrant, or disability); 'all' is every student.","description":"Demographic subgroup (FK to the global demographics dimension); 15 subgroups in every year. Race buckets use the combined asian_pacific_islander key (pre-1997 OMB convention): the source publishes six race buckets with a bare 'Asian' label and never a separate Pacific Islander row, and the six buckets' state-level student counts sum exactly to the 'all' total in every year except 2013 (where they fall short by exactly 8 of 1.84M students — an unallocated-records artifact, not a dropped-Pacific-Islander population). 'all' is the unfiltered total and overlaps every other value; subgroups are mutually exclusive only within their own category (race, gender, economic, English proficiency, disability, migrant).","valid_values":["all","asian_pacific_islander","black","economically_disadvantaged","english_learners","female","hispanic","male","migrant","multiracial","native_american","not_economically_disadvantaged","students_with_disabilities","students_without_disabilities","white"],"example":"all","unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":true,"metric_component":null},{"name":"num_students","type":"integer","role":"metric","nullable":true,"label":"Number Students","short_description":"","description":"Number of students in the subgroup used as the denominator for the rate metrics. A real 0 means the subgroup has no students at that entity. §4b mask: in 2023 (5 rows) and 2024 (6 rows) the source published STUDENT_COUNT_FEMALE = 0 alongside non-zero published rates at tiny special-population schools (e.g. East DeKalb Special Education Center 2023: all=80, male=71, female reads 0) — a zero population cannot carry non-zero rates, so those counts are NULLed and the rates kept.","valid_values":null,"example":954,"unit":"count","value_min":null,"value_max":null,"null_meaning":"Suppressed by GOSA (TFS literals 2021-2024, blank cells 2023-2024), or masked: 11 rows in 2023-2024 published an impossible count of 0 alongside non-zero rates.","key_metric_grain_contributor":false,"metric_component":"denominator"},{"name":"five_or_fewer_days_absent_rate","type":"number","role":"metric","nullable":true,"label":"Five or Fewer Days Absent Rate","short_description":"","description":"Proportion of the subgroup absent five or fewer days during the school year (0-1 scale; source publishes 0-100, divided by 100). Together with the 6-15 and over-15 tiers it partitions the subgroup population (sums to ~1.0). §4b mask: in 2005-2020 the source publishes 0 for all rate metrics when the subgroup has zero students; those placeholder zeros are NULLed (rates of an empty population are undefined; 2004 already publishes NULL there).","valid_values":null,"example":0.452,"unit":"proportion","value_min":null,"value_max":null,"null_meaning":"Suppressed by GOSA (blank cells in 2004, TFS/blank from 2021), or a zero-population placeholder masked to NULL (2005-2020).","key_metric_grain_contributor":false,"metric_component":null},{"name":"six_to_fifteen_days_absent_rate","type":"number","role":"metric","nullable":true,"label":"Six to Fifteen Days Absent Rate","short_description":"","description":"Proportion of the subgroup absent six to fifteen days during the school year (0-1 scale; source publishes 0-100, divided by 100). Middle tier of the three-tier partition. Zero-population placeholder zeros NULLed in 2005-2020 (see five_or_fewer_days_absent_rate).","valid_values":null,"example":0.381,"unit":"proportion","value_min":null,"value_max":null,"null_meaning":"Suppressed by GOSA (blank cells in 2004, TFS/blank from 2021), or a zero-population placeholder masked to NULL (2005-2020).","key_metric_grain_contributor":false,"metric_component":null},{"name":"over_15_days_absent_rate","type":"number","role":"metric","nullable":true,"label":"Over 15 Days Absent Rate","short_description":"","description":"Proportion of the subgroup absent more than fifteen days during the school year (0-1 scale; source publishes 0-100, divided by 100). Top tier of the three-tier partition; often used as a chronic-absenteeism proxy but distinct from chronically_absent_rate (different cutoff definitions). Zero-population placeholder zeros NULLed in 2005-2020 (see five_or_fewer_days_absent_rate).","valid_values":null,"example":0.168,"unit":"proportion","value_min":null,"value_max":null,"null_meaning":"Suppressed by GOSA (blank cells in 2004, TFS/blank from 2021), or a zero-population placeholder masked to NULL (2005-2020).","key_metric_grain_contributor":false,"metric_component":null},{"name":"chronically_absent_rate","type":"number","role":"metric","nullable":true,"label":"Chronically Absent Rate","short_description":"Share of the subgroup chronically absent (missing 10% or more of enrolled days), on a 0-1 scale; published from 2018.","description":"Proportion of the subgroup chronically absent — absent 10% or more of enrolled days, the federal definition (0-1 scale; source publishes 0-100, divided by 100). Published 2018-2024 only: the column does not exist in the 2004-2010 and 2012-2017 sources, and the 2011 column's only non-null values are zero-population placeholder zeros (masked to NULL), so 2004-2017 is entirely NULL. NOT equal to over_15_days_absent_rate — the cutoffs differ (10% of enrolled days vs a fixed 15-day count).","valid_values":null,"example":0.121,"unit":"proportion","value_min":null,"value_max":null,"null_meaning":"Not published before 2018 (no column 2004-2010 and 2012-2017; 2011's column holds only zero-population placeholders), suppressed (TFS/blank 2021-2024), or a zero-population placeholder masked to NULL (2018-2020).","key_metric_grain_contributor":false,"metric_component":null}],"key_metric":"chronically_absent_rate","usage":"Star-schema fact table. Join the districts dimension on district_code; the schools dimension on district_code + school_code; the demographics dimension on demographic. Read directly with DuckDB over Parquet.","limitations":"Suppressed cells are NULL (not zero): 2004 publishes blank cells for many subgroups (including some state-level rows); 2021-2022 use the TFS (too few students) literal; 2023-2024 mix TFS and blank cells per cell, so the tier metrics are not co-suppressed and small-population subgroups (notably native_american) are mostly NULL in 2021-2024. chronically_absent_rate is NULL for every year before 2018. Rate metrics are NULL (masked) where a subgroup has zero students. State rows have NULL district_code and school_code; district rows have NULL school_code. The race axis uses the combined asian_pacific_islander bucket — not comparable row-for-row with split-convention topics without aggregating those topics' asian + pacific_islander rows at query time.","null_semantics":{"suppressed_to_null":true,"zero_is_real":true},"available_years":[2004,2005,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017,2018,2019,2020,2021,2022,2023,2024],"schema_hash":"9033e13d86bf30d8958b6deedc65f10eebc3bb5e4ce4547215e5726aa86ecd95","tags":["education","gosa","attendance"],"granularity_description":"One row per year, district_code, school_code, demographic (geography columns are NULL at higher aggregation levels).","example_queries":[{"description":"Latest year (2024), schools detail","query":"SELECT * FROM attendance WHERE year = 2024 LIMIT 100"},{"description":"District 601 in 2024","query":"SELECT * FROM attendance WHERE district_code = '601' AND year = 2024 LIMIT 100"}],"foreign_keys":[{"column":"district_code","target_object":"districts","target_columns":["district_code"],"attribute_columns_added":["district_name","district_census_id","district_type"],"scope":"domain"},{"column":"school_code","target_object":"schools","target_columns":["district_code","school_code"],"attribute_columns_added":["school_name"],"scope":"domain"},{"column":"demographic","target_object":"demographics","target_columns":["demographic"],"attribute_columns_added":["demographic_label","demographic_category"],"scope":"global"}],"filters":[{"param":"year","kind":"year_exact","type":"integer","multi_value":false,"enum_enforced":false,"values":null},{"param":"year_min","kind":"year_range","type":"integer","multi_value":false,"enum_enforced":false,"values":null},{"param":"year_max","kind":"year_range","type":"integer","multi_value":false,"enum_enforced":false,"values":null},{"param":"detail","kind":"detail","type":"string","multi_value":false,"enum_enforced":true,"values":["districts","schools","states"],"default":"schools"},{"param":"district_code","kind":"foreign_key","column":"district_code","type":"string","multi_value":true,"enum_enforced":false,"values":null},{"param":"school_code","kind":"foreign_key","column":"school_code","type":"string","multi_value":true,"enum_enforced":false,"values":null},{"param":"demographic","kind":"foreign_key","column":"demographic","type":"string","multi_value":true,"enum_enforced":true,"values":["all","asian_pacific_islander","black","economically_disadvantaged","english_learners","female","hispanic","male","migrant","multiracial","native_american","not_economically_disadvantaged","students_with_disabilities","students_without_disabilities","white"]},{"param":"district_type","kind":"dimension_attribute","dimension":"districts","column":"district_type","type":"string","multi_value":true,"enum_enforced":true,"values":["commission_charter","resa","standard","state_agency","state_charter","state_school","state_special"]},{"param":"demographic_category","kind":"dimension_attribute","dimension":"demographics","column":"demographic_category","type":"string","multi_value":true,"enum_enforced":true,"values":["aggregate","disability","economic_status","esol","foster_care","gender","grade","homeless_status","migrant_status","military","race","sped"]}],"filter_hints":[]}