We are a group of analysts and developers on the cutting edge of baseball research and technology.  Our data, insights, visualizations, and applications are used by Astros' players, coaches, and personnel to make better in-game decisions, acquire and develop the best talent, and win championships.We are an agile team that adapts to changing priorities throughout the baseball calendar.  We emphasize rapid and iterative development and deployment across a wide variety of platforms to support different business needs.  We are not married to any one technology or stack, but instead encourage prototyping and exploration in new technologies as often as possible.

Houston Astros Baseball R&D

Houston, TX

Data Engineer, Research & Development

The Houston Astros are seeking a Baseball Systems Data Engineer for the team’s Baseball Research and Development group. The Data Engineer will join a cross-functional, agile team of analysts and developers, and will build and maintain systems that promote the use and understanding of analytical baseball data and information throughout Baseball Operations. The Data Engineer will be responsible for collecting, processing, storing, and integrating many sources of baseball data, as well as designing and building new data solutions, both on-premises and in the cloud.

Essential Duties & Responsibilities:

  • Create, maintain, and optimize ETL jobs for incoming data feeds
  • Develop data quality assurance tools to ensure data integrity and system performance
  • Design, implement, and maintain data mapping procedures
  • Maintain and support internal database solutions
  • Actively participate with software developers and architects in design reviews, code reviews, and other best practices
  • Work closely with baseball analysts to design and implement data solutions
  • Respond to and resolve technical problems and issues in a timely manner

Education and/or Experience:

  • Bachelor’s degree in computer science or related field is preferred
  • Experience with SQL Server and T-SQL
  • Experience with database maintenance and DevOps is preferred
  • Experience building data solutions using Python, C#, or other languages is preferred, including using REST and other APIs to load data from external sources
  • Experience with Microsoft Azure/Amazon Web Services/Google Cloud platform is preferred
  • Experience with non-SQL data management solutions is preferred
  • Experience working with baseball data (e.g., TrackMan, Statcast) is a plus
  • Strong analytical and problem-solving skills
  • Strong interpersonal and communication skills (written and verbal)

Work Environment

This job operates in an office setting. This role routinely uses standard office equipment such as computers, phones, photocopiers, and filing cabinets.  The noise level is usually moderate but can be loud within the stadium environment.

Physical Demands

The physical demands described here are representative of those that must be met by an employee to successfully perform the essential functions of this job. This is a largely sedentary role; however, some filing is required. This would require the ability to lift files, open filing cabinets and bend or stand on a stool as necessary.

Position Type and Expected Hours of Work

Ability to work a flexible schedule, including evenings, weekends, and holidays.


Rare travel may be expected in this role.

Other Duties

Please note this job description is not designed to cover or contain a comprehensive listing of activities, duties or responsibilities that are required of the employee for this job. Duties, responsibilities and activities may change at any time with or without notice.

We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, or any other characteristic protected by law.