Personal Tax Return Summary

This data model extracts key summary information from a completed US Personal Income Tax Return (Form 1040 series) and any associated state returns included in the document set. It captures essential details like the tax year, filer information (names, SSNs, filing status), dependents, a breakdown of income sources (wages, interest, dividends, capital gains, business income), adjustments to income, deductions taken (standard or itemized), tax calculation components, and significant tax credits claimed. This provides a consolidated overview of the filer's tax situation for the specified year as reported on their return.

Field nameTypeDescription
filing_statusstringThe tax filing status checked on Form 1040. Possible values: single, married_filing_jointly, married_filing_separately, head_of_household, qualifying_widow_er.
state_returns_includedstringComma-separated list of state abbreviations (e.g., CA, NY, NC) for which state tax returns are identified within the document set.
personsarrayList of individuals listed on the tax return (primary, spouse, dependents).
      namestringFull name of the person.
      ssnstringSocial Security Number (often redacted to last 4 digits).
      person_rolestringRole of the person on the return. Possible values: primary, spouse, dependent.
income_and_adjustmentsobjectSummary of income sources, details, and adjustments (above-the-line deductions).
      wages_salaries_tipsnumberTotal wages, salaries, tips, etc. reported.
      taxable_interestnumberTaxable interest income reported.
      ordinary_dividendsnumberOrdinary dividends income reported.
      qualified_dividendsnumberQualified dividends income reported.
      ira_distributions_taxablenumberTaxable amount of IRA distributions reported.
      pensions_annuities_taxablenumberTaxable amount of pensions and annuities reported.
      social_security_benefits_taxablenumberTaxable amount of Social Security benefits reported.
      short_term_net_capital_gain_lossnumberNet short-term capital gain or loss reported (from Schedule D).
      long_term_net_capital_gain_lossnumberNet long-term capital gain or loss reported (from Schedule D).
      capital_gain_distributionsnumberCapital gain distributions reported.
      capital_loss_carryforwardnumberCapital loss carryover to the next tax year (from Schedule D).
      unemployment_compensationnumberUnemployment compensation received.
      total_incomenumberTotal income reported before adjustments.
      deductible_part_of_se_taxnumberDeductible portion of self-employment tax (from Schedule 1).
      total_adjustmentsnumberTotal adjustments to income (from Schedule 1).
      adjusted_gross_income_aginumberAdjusted Gross Income (AGI) calculated on the return.
schedule_c_profit_lossarrayList of businesses reported on attached Schedule C forms.
      business_namestringName of the self-employed business (from Schedule C).
      net_profit_lossnumberNet profit or loss reported for the business (from Schedule C).
schedule_e_income_lossobjectSummary of supplemental income and loss from Schedule E.
      total_rental_real_estate_income_lossnumberTotal income or loss from rental real estate and royalties (from Schedule E).
      total_partnership_s_corp_income_lossnumberTotal income or loss from partnerships and S corporations (from Schedule E).
      total_schedule_e_income_lossnumberTotal supplemental income or loss reported (from Schedule E).
deductionsobjectSummary of deductions taken (Standard or Itemized details, QBI).
      deduction_methodstringIndicates if the Standard or Itemized deduction was taken. Possible values: standard, itemized, unknown.
      standard_deduction_amountnumberThe amount of the standard deduction claimed, if applicable.
      itemized_medical_expenses_deductednumberMedical and dental expenses deducted, if itemizing (from Schedule A).
      itemized_state_and_local_taxes_saltnumberState and local taxes (SALT) deducted, if itemizing (capped, from Schedule A).
      itemized_home_mortgage_interestnumberHome mortgage interest deducted, if itemizing (from Schedule A).
      itemized_investment_interestnumberInvestment interest deducted, if itemizing (from Schedule A).
      itemized_gifts_to_charitynumberGifts to charity deducted, if itemizing (from Schedule A).
      total_itemized_deductionsnumberThe total amount of itemized deductions claimed, if applicable (from Schedule A).
      qualified_business_income_qbi_deductionnumberQualified Business Income (QBI) deduction claimed.
      total_deductions_takennumberTotal deductions claimed (Standard/Itemized + QBI).
tax_calculationobjectSummary of tax liability components.
      taxable_incomenumberTaxable income after deductions.
      tax_ordinary_incomenumberTax calculated on ordinary income. May include tax on capital gains/dividends if not broken out separately.
      tax_long_term_capital_gains_qualified_dividendsnumberTax specifically calculated on long-term capital gains and qualified dividends, if shown separately on the return or worksheets.
      self_employment_tax_schedule_senumberTotal self-employment tax reported (from Schedule SE or Schedule 2).
      net_investment_income_tax_niitnumberNet Investment Income Tax (NIIT) reported (from Form 8960 or Schedule 2).
      additional_medicare_taxnumberAdditional Medicare Tax reported (from Form 8959 or Schedule 2).
      total_taxnumberTotal tax liability reported before payments and most credits.
tax_creditsobjectSummary of tax credits claimed.
      child_tax_credit_other_dependents_creditnumberTotal Child Tax Credit and/or Credit for Other Dependents claimed.
      dependent_care_creditnumberCredit for child and dependent care expenses claimed (from Form 2441 or Schedule 3).
      residential_clean_energy_creditnumberCredit for residential clean energy investments claimed (from Form 5695 or Schedule 3).
      total_credits_claimednumberTotal amount of key tax credits claimed (may include nonrefundable and refundable credits).

Field Notes

  • All monetary values are extracted directly as found on the tax forms and schedules; no calculations are performed.
  • filing_status: Extracted from the checkboxes on the first page of Form 1040. Allowed values: single, married_filing_jointly, married_filing_separately, head_of_household, qualifying_widow_er.
  • state_returns_included: A comma-separated string of standard two-letter state abbreviations (e.g., "CA,NY,NC") based on identified state return forms within the document.
  • persons.person_role: Indicates the role of the individual on the return. Allowed values: primary, spouse, dependent.
  • deductions.deduction_method: Determined based on whether the standard deduction amount is used or if itemized deductions from Schedule A are claimed. Allowed values: standard, itemized, unknown.
  • If itemized deductions are taken (deduction_method = itemized), the specific amounts for medical, SALT, mortgage interest, investment interest, and charity are extracted from Schedule A, along with the total itemized amount.
  • If the standard deduction is taken (deduction_method = standard), the standard_deduction_amount field is populated, and itemized fields will typically be null or zero.
  • Capital gain/loss details (short_term_net_capital_gain_loss, long_term_net_capital_gain_loss, capital_loss_carryforward) are sourced from Schedule D.
  • Schedule C and Schedule E details are extracted if those schedules are present in the document set.