{"main_topic":"education","sub_topic":"gosa","topic":"sat_scores_recent","key":"education/gosa/sat_scores_recent","version":"1.0.0","year_min":2004,"year_max":2024,"year_gaps":[],"update_frequency":"annual","detail_levels":["districts","schools","states"],"default_detail":"schools","path_template":"education/sat_scores_recent/year={year}/{detail}.parquet","grain":["year","district_code","school_code","demographic","test_component"],"title":"SAT Scores (Most Recent Administration)","summary":"Average SAT scores by section for Georgia public high schools, using each student's most recent test date, 2004-2024.","description":"Average SAT scores (most recent administration per student) and number of students tested for Georgia public high schools, with official district and state rollups, by SAT test component. 2004-2010 additionally break out demographic subgroups; 2011 onward is all-students only. Spans the 2016 SAT redesign: pre-redesign components (verbal_math, reading, mathematics, writing, combined, verbal_math_writing) and redesigned components (combined_test_score, math_section_score, evidence_based_reading_and_writing, reading_test_score, writlang_test_score, essay components) are distinct categorical values because they measure on different scales. Best-attempt scores live in the sibling topic sat_scores_highest.","source":"GOSA","source_url":"https://gosa.georgia.gov/dashboards-data-report-card/downloadable-data","endpoint":"/api/v1/education/sat_scores_recent","columns":[{"name":"year","type":"integer","role":"year","nullable":false,"label":"Year","short_description":"","description":"Reporting year. For 2011-2024 this is the spring (ending) calendar year of the school year in the source's LONG_SCHOOL_YEAR; for 2004-2010 the source carries no year column and the filename publication year is used.","valid_values":null,"example":2024,"unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":false,"metric_component":null},{"name":"district_code","type":"string","role":"fk_district","nullable":true,"label":"District Code","short_description":"","description":"GOSA district code (FK to districts dimension): 3-digit zero-padded county/city codes or 7-digit charter codes. NULL on state-level rows.","valid_values":null,"example":"601","unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":false,"metric_component":null},{"name":"school_code","type":"string","role":"fk_school","nullable":true,"label":"School Code","short_description":"","description":"GOSA school code, zero-padded to 4 characters (composite FK to schools dimension with district_code; not globally unique on its own). NULL on district- and state-level rows.","valid_values":null,"example":"0103","unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":false,"metric_component":null},{"name":"demographic","type":"string","role":"fk_demographic","nullable":false,"label":"Demographic","short_description":"Student race or gender subgroup; breakdowns exist only for 2004-2010, every later row is 'all'.","description":"Demographic subgroup (FK to the global demographics dimension). Subgroup breakdowns exist only for 2004-2010; from 2011 onward the source publishes all-students rows only, so every 2011+ row is 'all'. The asian_pacific_islander key reflects the source's pre-1997 OMB combined Asian + Pacific Islander bucket (the bronze label is bare 'Asian', but no era publishes a separate Pacific Islander row and sibling GOSA reports use the explicit combined label) — it is NOT an Asian-only count, per data-cleaning-standards section 5b. race_unknown (no-response) carries data only in 2004; 2005-2007 publish the slot empty and 2008+ drop it.","valid_values":["all","asian_pacific_islander","black","female","hispanic","male","native_american","other","race_unknown","white"],"example":"all","unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":true,"metric_component":null},{"name":"test_component","type":"string","role":"categorical","nullable":false,"label":"Test Component","short_description":"Which SAT section or composite the score is for; scales differ, so never compare across components.","description":"SAT test component. Pre-redesign (scales in parentheses): reading (200-800; published as 'Verbal' until 2011, the same Critical Reading section, 2004-2016), mathematics (200-800, 2004-2016), writing (200-800, 2008-2016), verbal_math (V+M composite, 400-1600, 2004-2010), verbal_math_writing (V+M+W composite, 600-2400, 2008-2010, all-students only), combined (V+M+W composite, 600-2400, 2011-2016). Redesigned SAT (2016 onward): combined_test_score (Math+EBRW, 400-1600), math_section_score (200-800), evidence_based_reading_and_writing (200-800, 2016-2019 only), reading_test_score and writlang_test_score (GOSA-rescaled test scores, observed 180-338), essay_reading_score / essay_analysis_score / essay_writing_score (2-8) and essay_total (6-24, a GOSA-derived sum of the three essay dimensions, not an official College Board score). Old and redesigned components are distinct values because they measure on different scales.","valid_values":["combined","combined_test_score","essay_analysis_score","essay_reading_score","essay_total","essay_writing_score","evidence_based_reading_and_writing","math_section_score","mathematics","reading","reading_test_score","verbal_math","verbal_math_writing","writing","writlang_test_score"],"example":"combined_test_score","unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":true,"metric_component":null},{"name":"num_tested","type":"number","role":"metric","nullable":true,"label":"Number Tested","short_description":"","description":"Number of students tested. Float64, NOT Int64 — a documented exception to the integer-count convention (data-cleaning-standards section 16): 2016-2024 sources publish fractional counts on the combined_test_score component because GOSA reports it as the equal-weight mean of the three SAT section test-taker counts (Math, Reading, Writing & Language), not a headcount (e.g. (47+37+37)/3 -> 40.3 — exact thirds where published full-precision, rounded to one decimal from ~2020), and 2023-24 district counts are fractional throughout (e.g. 4159.3). Rounding to an integer would silently lose precision, so values are carried as published. 2004-2010 sources publish one count per (entity, demographic), repeated on each of that demographic's component rows; 2011-2024 publish true per-component counts. Either way, summing num_tested across test_component double-counts students — filter to one component for headcounts. A 0 means the source reported zero test-takers for that demographic (2004-2010 only).","valid_values":null,"example":156.0,"unit":"count","value_min":null,"value_max":null,"null_meaning":"Suppressed by GOSA (too few students; TFS marker in 2011-2024 sources).","key_metric_grain_contributor":false,"metric_component":"denominator"},{"name":"avg_score","type":"number","role":"metric","nullable":true,"label":"Average Score","short_description":"Average most-recent-attempt SAT score; on the scale of the section named by test_component.","description":"Average SAT score. The scale varies by test_component (sections 200-800; verbal_math / combined_test_score 400-1600; combined / verbal_math_writing 600-2400; essay dimensions 2-8; essay_total 6-24), so this column carries no single value_min/value_max — the avg_score_within_component_scale quality check enforces the per-component ranges instead. reading_test_score and writlang_test_score are excluded from that check: GOSA publishes them on a rescaled axis (observed 180-338, consistent with 10x the College Board 10-40 test-score scale) with no clean published ceiling. Known source defects NULLed by the transform per data-cleaning-standards section 4b (rows and num_tested preserved): seven values above their component ceilings — the 2009 Carroll County district verbal_math_writing of 2751 (> 2400), and in 2010 Rockdale County High School (722:3052) verbal_math_writing=3114, verbal_math=2133, reading=1039, mathematics=1094, writing=981 plus Elberta Open Campus (676:3050) reading=819 (> 800) — and nine 2011-2015 writing averages below the 200 section floor (as low as 92, all small schools/districts). This revises the preserve-bronze default for those values; the per-component range check remains enforceable because of it. Extreme-but-conceivable values are preserved: Elberta Open Campus's 2010 verbal_math_writing of 2287 is within the 600-2400 scale and is carried as published (suspect — the same row's reading section is impossible — but not provably wrong itself).","valid_values":null,"example":982.3,"unit":"score","value_min":null,"value_max":null,"null_meaning":"Suppressed by GOSA (too few test-takers), a zero-test-taker demographic cell (2004-2010), or one of 16 impossible source values NULLed by the transform.","key_metric_grain_contributor":false,"metric_component":null}],"key_metric":"avg_score","usage":"Star-schema fact table. Join the districts dimension on district_code; the schools dimension on district_code + school_code; the demographics dimension on demographic. Read directly with DuckDB over Parquet.","limitations":"Suppressed cells are NULL (not zero). State rows have NULL district_code and school_code; district rows have NULL school_code. Scores are NOT comparable across the 2016 SAT redesign boundary — use the distinct pre-redesign vs redesigned test_component values, never mix them in one time series. Demographic subgroups exist only for 2004-2010. num_tested can be fractional (see column description). reading_test_score / writlang_test_score are on a GOSA-rescaled axis, not the College Board 10-40 scale and not the 200-800 section scale.","null_semantics":{"suppressed_to_null":true,"zero_is_real":true},"available_years":[2004,2005,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017,2018,2019,2020,2021,2022,2023,2024],"schema_hash":"675005c92aa47a74bca9b4271362bb5d6105914ed5d308994fe51dd31d69efba","tags":["education","gosa","sat_scores_recent"],"granularity_description":"One row per year, district_code, school_code, demographic, test_component (geography columns are NULL at higher aggregation levels).","example_queries":[{"description":"Latest year (2024), schools detail","query":"SELECT * FROM sat_scores_recent WHERE year = 2024 LIMIT 100"},{"description":"District 601 in 2024","query":"SELECT * FROM sat_scores_recent WHERE district_code = '601' AND year = 2024 LIMIT 100"},{"description":"Filter by test_component = combined_test_score","query":"SELECT * FROM sat_scores_recent WHERE test_component = 'combined_test_score' AND year = 2024 LIMIT 100"}],"foreign_keys":[{"column":"district_code","target_object":"districts","target_columns":["district_code"],"attribute_columns_added":["district_name","district_census_id","district_type"],"scope":"domain"},{"column":"school_code","target_object":"schools","target_columns":["district_code","school_code"],"attribute_columns_added":["school_name"],"scope":"domain"},{"column":"demographic","target_object":"demographics","target_columns":["demographic"],"attribute_columns_added":["demographic_label","demographic_category"],"scope":"global"}],"filters":[{"param":"year","kind":"year_exact","type":"integer","multi_value":false,"enum_enforced":false,"values":null},{"param":"year_min","kind":"year_range","type":"integer","multi_value":false,"enum_enforced":false,"values":null},{"param":"year_max","kind":"year_range","type":"integer","multi_value":false,"enum_enforced":false,"values":null},{"param":"detail","kind":"detail","type":"string","multi_value":false,"enum_enforced":true,"values":["districts","schools","states"],"default":"schools"},{"param":"district_code","kind":"foreign_key","column":"district_code","type":"string","multi_value":true,"enum_enforced":false,"values":null},{"param":"school_code","kind":"foreign_key","column":"school_code","type":"string","multi_value":true,"enum_enforced":false,"values":null},{"param":"demographic","kind":"foreign_key","column":"demographic","type":"string","multi_value":true,"enum_enforced":true,"values":["all","asian_pacific_islander","black","female","hispanic","male","native_american","other","race_unknown","white"]},{"param":"test_component","kind":"categorical","column":"test_component","type":"string","multi_value":true,"enum_enforced":true,"values":["combined","combined_test_score","essay_analysis_score","essay_reading_score","essay_total","essay_writing_score","evidence_based_reading_and_writing","math_section_score","mathematics","reading","reading_test_score","verbal_math","verbal_math_writing","writing","writlang_test_score"],"has_total":true,"total_value":"combined","requires_single_value":false},{"param":"district_type","kind":"dimension_attribute","dimension":"districts","column":"district_type","type":"string","multi_value":true,"enum_enforced":true,"values":["commission_charter","resa","standard","state_agency","state_charter","state_school","state_special"]},{"param":"demographic_category","kind":"dimension_attribute","dimension":"demographics","column":"demographic_category","type":"string","multi_value":true,"enum_enforced":true,"values":["aggregate","disability","economic_status","esol","foster_care","gender","grade","homeless_status","migrant_status","military","race","sped"]}],"filter_hints":[]}