{"main_topic":"education","sub_topic":"gosa","topic":"sat_scores_highest","key":"education/gosa/sat_scores_highest","version":"1.0.0","year_min":2004,"year_max":2024,"year_gaps":[],"update_frequency":"annual","detail_levels":["districts","schools","states"],"default_detail":"schools","path_template":"education/sat_scores_highest/year={year}/{detail}.parquet","grain":["year","district_code","school_code","demographic","test_component"],"title":"SAT Scores (Highest Section Score)","summary":"Average SAT scores by section for Georgia public high schools, using each student's highest score across test dates, 2004-2024.","description":"Average SAT scores by test section for Georgia public high schools, with official district and state rollups, using each student's HIGHEST section score across SAT administrations (the companion sat_scores_recent topic reports the most-recent administration instead). Spans the 2004-2024 GOSA publications across the old SAT (Verbal/Math, later +Writing) and the 2016 redesigned SAT (Evidence-Based Reading and Writing, Math, optional Essay).","source":"GOSA","source_url":"https://gosa.georgia.gov/dashboards-data-report-card/downloadable-data","endpoint":"/api/v1/education/sat_scores_highest","columns":[{"name":"year","type":"integer","role":"year","nullable":false,"label":"Year","short_description":"","description":"Reporting year. For 2011-2024 this is the spring (ending) calendar year of the school year in the source's LONG_SCHOOL_YEAR; for 2004-2010 the source carries no year column and the filename publication year is used.","valid_values":null,"example":2024,"unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":false,"metric_component":null},{"name":"district_code","type":"string","role":"fk_district","nullable":true,"label":"District Code","short_description":"","description":"GOSA district code (FK to districts dimension): 3-digit zero-padded county/city codes or 7-digit state-charter codes. NULL on state-level rows.","valid_values":null,"example":"601","unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":false,"metric_component":null},{"name":"school_code","type":"string","role":"fk_school","nullable":true,"label":"School Code","short_description":"","description":"GOSA school code, zero-padded to 4 characters (composite FK to schools dimension with district_code; not globally unique on its own). NULL on district- and state-level rows.","valid_values":null,"example":"0103","unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":false,"metric_component":null},{"name":"demographic","type":"string","role":"fk_demographic","nullable":false,"label":"Demographic","short_description":"Student race or gender subgroup; breakdowns exist only for 2004-2010, every later row is 'all'.","description":"Student demographic (FK to demographics dimension). Breakdowns exist only for 2004-2010: race buckets (asian_pacific_islander, black, hispanic, native_american, white, other, race_unknown) and gender (female, male); every 2011-2024 row is 'all'. The source's bare 'Asian' label is the pre-1997 OMB COMBINED Asian/Pacific Islander bucket (no era of this source publishes a separate Pacific Islander row), so it maps to asian_pacific_islander — race buckets are mutually exclusive within this topic. 2007-2010 subgroup rows carry counts only (avg_score is NULL); race_unknown ('No Response') carries data in 2004 only (the source's R columns exist through 2007 but are blank in 2005-2006 and literal-'NULL' in 2007).","valid_values":["all","asian_pacific_islander","black","female","hispanic","male","native_american","other","race_unknown","white"],"example":"all","unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":true,"metric_component":null},{"name":"test_component","type":"string","role":"categorical","nullable":false,"label":"Test Component","short_description":"Which SAT section or composite the score is for; scales differ, so never compare across components.","description":"SAT section/composite, era-aware: 2004-2010 wide files publish reading (source 'High Verbal'), mathematics ('High Math'), verbal_math (their 400-1600 two-section total, equal to reading + mathematics) and writing (2008-2010); 2011-2016 old-SAT files publish reading, mathematics, writing and combined (the 600-2400 three-section total); 2016-2024 redesigned-SAT files publish combined_test_score (400-1600), math_section_score (200-800), reading_test_score / writlang_test_score (GOSA-rescaled 10-40 section tests, observed ~190-340), evidence_based_reading_and_writing (200-800, 2016-2019 only), and the essay components (essay_reading_score / essay_analysis_score / essay_writing_score on 2-8, essay_total on 6-24). Old-SAT and redesigned-SAT composites are NOT comparable across 2016.","valid_values":["combined","combined_test_score","essay_analysis_score","essay_reading_score","essay_total","essay_writing_score","evidence_based_reading_and_writing","math_section_score","mathematics","reading","reading_test_score","verbal_math","writing","writlang_test_score"],"example":"combined_test_score","unit":null,"value_min":null,"value_max":null,"null_meaning":null,"key_metric_grain_contributor":true,"metric_component":null},{"name":"num_tested","type":"number","role":"metric","nullable":true,"label":"Number Tested","short_description":"","description":"Number of students tested. Float64 by documented exception (data-cleaning-standards §16): 2016-2024 bronze publishes FRACTIONAL counts on the combined_test_score component because GOSA reports it as the equal-weight mean of the three SAT section test-taker counts (Math, Reading, Writing & Language), not a headcount (e.g. (47+37+37)/3 -> 40.3 — exact thirds in 2016-2019, rounded to one decimal from ~2020); rounding to an integer would lose precision, and every other component's count is integral (enforced by a quality check). 2004-2010 sources publish one count per (entity, demographic) administration, repeated on each of that demographic's component rows; 2011-2024 sources publish true per-component counts. Either way, summing num_tested across test_component double-counts students — filter to one component for headcounts. A count of 0 is a real observation (zero students in that demographic took the SAT).","valid_values":null,"example":104,"unit":"count","value_min":null,"value_max":null,"null_meaning":"Suppressed by GOSA (TFS marker, 2011-2024) or not published for that rollup.","key_metric_grain_contributor":false,"metric_component":"denominator"},{"name":"avg_score","type":"number","role":"metric","nullable":true,"label":"Average Score","short_description":"Average highest-attempt SAT score; on the scale of the section named by test_component.","description":"Average SAT score on the component's own scale (see test_component; scales range from essay 2-8 up to combined 600-2400, so never aggregate avg_score across components). Known source defects NULLed per data-cleaning-standards §4b (rows and num_tested preserved; 20 values): (a) 2009 Heritage High School (722:176) published reading=1022, mathematics=995, writing=976, verbal_math=2017 and South Atlanta Leadership (761:308) writing=811 — all above the 200-800 / 400-1600 scales; (b) 2010 Rockdale County High (722:3052) published reading=1055, mathematics=1113, writing=1009, verbal_math=2168 and Elberta Open Campus (676:3050) reading=822; (c) 2011-2015 published 10 old-SAT writing averages below the 200 section floor (7 school + 3 district rollup values, as low as 112). An average cannot leave the per-student scale, so these are publication errors; NULLing them revises the preserve-bronze default for this column and keeps the per-component range quality check enforceable. Preserved-but-flagged instead: 2010 Elberta verbal_math=1522 (in range, but its masked reading component makes it suspect) and the systematically low 2011-2016 writing averages (state 361-406 vs ~470 on the recent-SAT basis) — see limitations.","valid_values":null,"example":1043.6,"unit":"score","value_min":null,"value_max":null,"null_meaning":"Suppressed by GOSA (too few test-takers), a count-only observation (all 2007-2010 demographic subgroup rows), or one of 20 impossible source values NULLed by the transform (2009-2015).","key_metric_grain_contributor":false,"metric_component":null}],"key_metric":"avg_score","usage":"Star-schema fact table. Join the districts dimension on district_code; the schools dimension on district_code + school_code; the demographics dimension on demographic. Read directly with DuckDB over Parquet.","limitations":"Suppressed cells are NULL (not zero). State rows have NULL district_code and school_code; district rows have NULL school_code. avg_score scales differ by test_component — never compare or aggregate across components, and never sum num_tested across components (it double-counts students). Old-SAT (pre-2016) and redesigned-SAT (2016+) results are not comparable. HIGHEST-basis caveat for the old-SAT writing component (2011-2016): GOSA's published averages are systematically far below the recent-basis values (state 361-406 vs ~470) and 10 values fall below the section's 200 floor (NULLed); in-range writing values — and the combined composite that embeds them — appear deflated on the highest basis and should be used with caution. The 2010 Elberta Open Campus verbal_math value (1522) is preserved although its reading component (822) was impossible and NULLed.","null_semantics":{"suppressed_to_null":true,"zero_is_real":true},"available_years":[2004,2005,2006,2007,2008,2009,2010,2011,2012,2013,2014,2015,2016,2017,2018,2019,2020,2021,2022,2023,2024],"schema_hash":"54310a68e15a2192a2cc29d30867e593e87667bc641b53c7e54127633abfa3e7","tags":["education","gosa","sat_scores_highest"],"granularity_description":"One row per year, district_code, school_code, demographic, test_component (geography columns are NULL at higher aggregation levels).","example_queries":[{"description":"Latest year (2024), schools detail","query":"SELECT * FROM sat_scores_highest WHERE year = 2024 LIMIT 100"},{"description":"District 601 in 2024","query":"SELECT * FROM sat_scores_highest WHERE district_code = '601' AND year = 2024 LIMIT 100"},{"description":"Filter by test_component = combined_test_score","query":"SELECT * FROM sat_scores_highest WHERE test_component = 'combined_test_score' AND year = 2024 LIMIT 100"}],"foreign_keys":[{"column":"district_code","target_object":"districts","target_columns":["district_code"],"attribute_columns_added":["district_name","district_census_id","district_type"],"scope":"domain"},{"column":"school_code","target_object":"schools","target_columns":["district_code","school_code"],"attribute_columns_added":["school_name"],"scope":"domain"},{"column":"demographic","target_object":"demographics","target_columns":["demographic"],"attribute_columns_added":["demographic_label","demographic_category"],"scope":"global"}],"filters":[{"param":"year","kind":"year_exact","type":"integer","multi_value":false,"enum_enforced":false,"values":null},{"param":"year_min","kind":"year_range","type":"integer","multi_value":false,"enum_enforced":false,"values":null},{"param":"year_max","kind":"year_range","type":"integer","multi_value":false,"enum_enforced":false,"values":null},{"param":"detail","kind":"detail","type":"string","multi_value":false,"enum_enforced":true,"values":["districts","schools","states"],"default":"schools"},{"param":"district_code","kind":"foreign_key","column":"district_code","type":"string","multi_value":true,"enum_enforced":false,"values":null},{"param":"school_code","kind":"foreign_key","column":"school_code","type":"string","multi_value":true,"enum_enforced":false,"values":null},{"param":"demographic","kind":"foreign_key","column":"demographic","type":"string","multi_value":true,"enum_enforced":true,"values":["all","asian_pacific_islander","black","female","hispanic","male","native_american","other","race_unknown","white"]},{"param":"test_component","kind":"categorical","column":"test_component","type":"string","multi_value":true,"enum_enforced":true,"values":["combined","combined_test_score","essay_analysis_score","essay_reading_score","essay_total","essay_writing_score","evidence_based_reading_and_writing","math_section_score","mathematics","reading","reading_test_score","verbal_math","writing","writlang_test_score"],"has_total":true,"total_value":"combined","requires_single_value":false},{"param":"district_type","kind":"dimension_attribute","dimension":"districts","column":"district_type","type":"string","multi_value":true,"enum_enforced":true,"values":["commission_charter","resa","standard","state_agency","state_charter","state_school","state_special"]},{"param":"demographic_category","kind":"dimension_attribute","dimension":"demographics","column":"demographic_category","type":"string","multi_value":true,"enum_enforced":true,"values":["aggregate","disability","economic_status","esol","foster_care","gender","grade","homeless_status","migrant_status","military","race","sped"]}],"filter_hints":[]}