Data Science Applications to Astronomy

Syllabus

(version v0.2)

Basic Information

  • Course: Astro 416: Data Science Applications to Astronomy

  • Semester: (Spring 2025)

  • Class Meetings: 10:10-11:00am MWF

  • Location: Davey Lab 538

  • Instructor: Eric Ford

  • Email: ebf11 _at_ psu.edu

  • Graduate Teaching Assistant: Andrew Pellegrino

  • Email: axp1175 _at_ psu.edu

  • Office Hours: TBD

Course Overview

Students will build practical data science skills (e.g., querying astronomical databases, data storage and manipulation, data visualization, exploratory and explanatory data analysis, Bayesian modeling workflows, and reproducible research practices) and apply these lessons to analyzing data from astronomical surveys.

Course Goals

Successful students in the class will:

  • Increase their data acumen, and

  • Appreciate how building data science skills can benefit astronomy & astrophysics research.

Learning Objectives

Successful students in the class will:

  • Ingest and manipulate data from astronomical surveys.

  • Build, apply, assess and update astrophysically motivated models for astronomical observations.

  • Create visualizations for exploratory and explanatory data analyses of observations from astronomical surveys.

  • Synthesize the above into a dashboard to support the efficient analysis of astronomical observations.

  • Incorporate principles of reproducible research into their class project.

Course Content & Structure

In a typical week:

  • Monday lectures will introduce foundational Data Science knowledge.

  • Wednesday computer labs give students a chance to apply that knowledge.

  • Friday discussions will focus on practical challenges for astronomical applications.

Students will continue working on the computer lab begun on Wednesday as homework, typically due prior to class the following Monday. Inevitably, there will be some deviations (e.g., getting started week, weeks with a holiday or exam, week of student presentations, etc.).

Students will submit at least one questions about the week's content prior to Friday class to help inform the Friday discussion.

Schedule of Topics

WeekData Science
1What is Data Science?
2Exploratory Data Analysis
3Model Building
4Model Assessment
5Data Wrangling
6Data Storage
7Exam week
8Regularizaton & Cross-Validation
SBSpring Break
9Classification
10Data Visualization
11Reports & Dashboards
12Data Science Workflow
13Reproducible Research
14Retrospective & Student Presentations
15Student Presentations


The schedule is subject to change. Any changes will be announced via Canvas.

Expected Student Preparation

This class assumes that students have knowledge of calculus-based mechanics and astronomical methods and experience with basic programming concepts (e.g., data types, arrays, functions, loops and conditional statements), but not with numerical methods or statistical theory. Homework/lab assignments will make use of the Julia programming language, but no prior experience with Julia is expected. During the first half of the semester, examples or starter code will be provided so that students can focus on concepts rather than implementation details. During the second half of the semester, students will create a dashboard for analyzing and visualizing data from an astronomical survey. This will require synthesizing astronomy knowledge with Data Science skills built throughout the semester.


Formally, the prerequisites for the class are:

  • ( ASTRO 291 OR (ASTRO 401 AND ASTRO 402W) ) AND

  • ( ASTRO 21 OR CMPSC 121 OR CMPSC 131 OR CMPSC 201 ) AND

  • ( MATH 230 OR MATH 231) AND

  • ( PHYS 211 )

Any student who is interested in the class and has not completed the above, should consult with the instructor before starting the class. For example, students with programming experience from Astro 97 are encouraged to participate, but must submit a prerequisite override course in LionPATH to be allowed to register.

Relation to Other Courses

Data Science represents the synthesis of knowledge and skills from mathematics, statistics, computer science, and an application domain, as well several supporting skills (e.g., workflows, visualization, data ethics, communications, teamwork). Even students majoring in Data Sciences at PSU will not take classes covering all the important Data Science topics! This class is designed to complement ASTRO 410, 415 and 451 (and classes beyond the Astronomy department, e.g. MATH 220 and DS 310), so as to provide astronomy majors an introduction to data science skills that are particularly relevant to astronomical research and can be readily applied to other disciplines.


Since this class does not presume ASTRO 410, 415 or 451 as a prerequisite, there will be some modest overlap (i.e., essentials to fit a model to data). While ASTRO 451 provides a broad overview of techniques for conducting astronomical observations, this class will go into more detail about the methods for analyzing data that has been collected as part of an astronomical survey. While ASTRO 415 will provide a more rigorous treatment of statistical foundations, this class will emphasize applying such techniques and developing a broader set of data science skills. While ASTRO 410 will teach students how to implement numerical methods common in astrophysics research, this class will apply numerical methods that have already been implemented for students. The instructor will point out connections to other courses where students could learn more about a topic.

Required Course Materials

Textbooks

While there will be required readings, we will primarily use open online sources, so that students do not need to purchase a physical textbook. Formally, the required textbooks for this course are:

Additional resources

Links to additional readings and other online resources will be provided via Canvas. Students will also need a computer, modern web browser and high-speed internet access, so students can effectively complete computing assignments and participate in any Zoom classes.

Assessment and Grading Policy

Assessment will be in four categories:

  • Lab/Homework Assignments: 36%

  • Exam/Quizzes: 9%

  • Class Project: 45%

    • Project Plan: 5%

    • Checkpoints: 10%

    • Dashboard: 20%

    • Presentation: 5%

    • Reflection: 5%

  • Class Participation: 10%

    • Tophat Questions: 5%

    • Lab Participation: 5%

Lab/Homework Assignments

Most weeks, students will have a chance to begin work on a computer lab/homework assignment in small groups. Homework exercises will typically be due before the start of class on Monday. Any deviations from this schedule will be announced via Canvas. If the University is closed on the due date of an assignment (due to holiday or bad weather or any other reason), then the assignment will be due by 10am before the next class session (that is not canceled). There will be a 10% penalty on assignments submitted after the deadline and up to one week after deadline, and a 20% penalty on assignments submitted more than one week and up to two weeks after the deadline, and a 30% penalty on assignments submitted more than 2 weeks after the deadline.

Examination/Quiz Policy

A midterm exam will be taken during one of our regular class times. The only two conditions under which you can request a makeup exam: (1) If you know in advance that you will have to miss a midterm exam for a religious observance or university sponsored trip (e.g., athletics, research field work, class field trip), then you must request a makeup midterm in advance of the exam and as early in the semester as possible, to facilitate scheduling. (2) If you are injured or ill or in isolation or quarantine during the midterm exam, you should request a makeup exam as soon as practical. If any of the above cases, documentation may be requested. Please email the instructor as soon as you are able to request scheduling a makeup exam. The timing and format of a makeup midterm exam will depend on the circumstances and be at the instructor's discretion. For a religious holiday, the instructor would likely recommend taking the same exam early. For an extended illness, the instructor would likely recommend taking an alternative exam at the end of the class to substitute for the mid-term exam. Possible alternative exam formats include (but are not limited to) short answer, essay, or oral exam.

The intent is that most students will not take a final exam. However, the final exam week may be used in unusual circumstances (e.g., in place of the mid-term exam due to an extended illness, need an alternative project format due to accessibility issues).

There maybe short unannounced in-class quizzes during any class session. Students should let the instructor know in advance (whenever possible) if they will be unable to attend class due to illness, religious holiday, or university-approved travel, so they can be excused from in-class quizes that day.

Class Project

Students will synthesize lessons learned in the class by building a “dashboard” that ingests astronomical data, performs basic data manipulations, fits a model to the data, assesses the quality of the model for the given observations, and effectively visualizes the results. This is a substantive project and students should spread their effort over several weeks of the semester. To encourage making steady progress, students will earn credit for submitting a plan and demonstrating significant progress for two checkpoints. Students are encouraged to work in small teams of two to three, so that they can build a high-quality dashboard. Groups will submit a single dashboard and present their dashboard to the full class during the final weeks of class as a group. Each student will individually submit a final statement describing their contributions to the dashboard project, describing the contributions of their teammates, and reflecting on what they learned from the experience. Remember that both you and other group members will have other assignments, exams, and projects. Therefore, it is very important to develop a mutually agreeable schedule and to follow through on your contributions in a timely fashion.

Tophap Questions

All students should submit at least one question per week (e.g., a specific question to clarify something that was unclear to them from that week's class or reading) via TopHat no later 9am Eastern Time on Friday. There is also a link to the course TopHat site inside the Canvas webpage. Submitting your questions well before class starts is important, so the instructor will have time to read the questions and update the day’s lesson to respond to student questions. You’re strongly encouraged to take a look at questions submitted by other students and give a “thumbs up” to indicate those questions that you’d like to be addressed in class. Since reading questions can not be made up, each student’s three lowest tophat question scores will be dropped when computing final grades. Even if you're unable to attend a class, in most cases you can still submit a tophat question to earn credit. Students who are reluctant to ask questions in class are especially encouraged to ask extra questions prior to class via TopHat.

Some weeks, students will be assigned a short reading (or video) prior to class. These weeks, students will also be asked to respond to short reading questions via TopHat no later 9am Eastern Time on day that the reading is to be completed by. Reading assignments will appear in the lesson pages of the website.

Lab Participation

In-class computing lab sessions will provide an opportunity to work on lab/homework assignments. Students should participate regularly, so as to help solidify their understanding of topics from class and to build practical data science skills. Gaining experience communicating technical information and working as a team is an important part of the course, so are encouraged to work in small groups. Attendance will be taken during selected classes (particularly the lab sessions Wednesdays and/or Friday) for the lab participation grade. If the student is not present when attendance is taken, then they can earn a maximum of half a point for that week’s class participation (even if they arrive later or were participating previously that day). If you know you need to miss class (e.g., university-approved travel, health issues, isolation or quarantine), then let the instructor know in advance whenever practical, so the absence can be removed from your average.

Safety

While attendance and participation in class is important to the class and your learning, it is more important that we all stay safe and healthy. All students must follow all health and safety protocols required or recommended by the university. University policies and recommendations may change during the semester. The most up-to-date information can be accessed at https://virusinfo.psu.edu/university-status/.

As of January 2025, the university policy is:

Stay home and away from others if you are experiencing fever or respiratory symptoms such as but not limited to cough, sore throat, runny nose, chills, fatigue, headache, body aches. Return to normal activities when, for at least 24 hours, both are true:

  • Your symptoms are getting better overall, AND

  • You have not had a fever (and are not using fever-reducing medication)

Then, take these additional precautions for the next five days to limit the spread of infection:

  • Wear a well-fitting mask

  • Keep a distance from others and/or

  • Get tested to inform your actions to prevent the spread to others

If you begin feeling worse and/or fever returns, stay home and away from others for at least 24 hours until both are true:

  • Your symptoms are getting better overall, and

  • You have not had a fever (and are not using fever-reducing medication)

If you are unable to attend class, then you can still earn full credit for tophat questions and by submitting questions via tophat for that week. Students should make plans to get a classmate’s notes for any missed class sessions. Some class sessions may be moved online, based on community conditions or if the instructor needs to quarantine or isolate.

Academic Integrity

All Penn State and Eberly College of Science policies regarding academic integrity, ethics and honorable behavior apply to this course. In light of the fact that group work is highly encouraged, and to fully facilitate best ethical practices and academic integrity, the following rules apply:

The intellectual content of all submitted assignments should be the student's own work and not the output of AI tool. Using an AI-based grammar checker is acceptible and does not need to be disclosed for the labs or project in this course. Otherwise, students must fully disclose any and all use of artificial intelligence (AI) in completing their assignemnts at the time of submission. Students may receive reduced credit for assignments where AI tools were used. If you're unsure what's appropriate, then ask in advance of submission. Limited AI Use

All work submitted for an exam must be entirely the student's own work. For other assignments (i.e., homework/labs and class project), all ideas and work derived from resources beyond class notes must properly acknowledge or reference sources including: assigned readings (including textbooks and online sources), websites, classmates, other students, and solution sets from other or prior courses, etc. This means you should work together on labs/homework assignments, but each student should respond to questions individually and make liberal use of acknowledgments. For the class project, students will be encouraged to work in small teams of two or three students. In the final report, students are required to describe their contributions to the project accurately and to give credit to their teammates for their contributions.

Academic integrity is the pursuit of scholarly activity in an open, honest and responsible manner. Academic integrity is a basic guiding principle for all academic activity at The Pennsylvania State University, and all members of the University community are expected to act in accordance with this principle. Consistent with this expectation, the University’s Code of Conduct states that all students should act with personal integrity, respect other students’ dignity, rights and property, and help create and maintain an environment in which all can succeed through the fruits of their efforts.

Academic integrity includes a commitment by all members of the University community not to engage in or tolerate acts of falsification, misrepresentation or deception. Such acts of dishonesty violate the fundamental ethical principles of the University community and compromise the worth of work completed by others.

Recordings of classes

Some classes may be recorded. Ay students who prefer to not ask questions while being recorded are encouraged to submit questions in advance of class.

Video and audio recordings of classes are part of the class activities. Any video and audio recordings are used for educational use/purposes and only may be made available to all students presently enrolled in the class. For purposes where the recordings will be used in future class sessions/lectures beyond this class, any type of identifying information will be adequately removed.

According to University Policy, students must get express permission from their instructor to record class sessions. Screenshots showing instructors and students are considered recordings. Even if permission is granted, student-initiated recordings must be used only for educational purposes for the students enrolled in the initiating student’s class. Recordings may be used only during the period in which the student is enrolled in the class. Authorized student-initiated recordings may not be posted or shared in any fashion outside of the class, including online or through other media, without the express written consent of the course instructor or appropriate University administrator. Students who engage in the unauthorized distribution of class recordings may be held in violation of the University’s Code of Conduct, and/or liable under Federal and State laws.

Instructions for a campus closure or other adjustment

In the event of any changes to the schedule (e.g., due to a campus closure or delayed start, instructor illness, etc.), any changes in class meeting times, class format (in-person or Zoom), assignment deadlines, submission procedures, exam procedures, or any other necessary instructions will be communicated via an announcement in Canvas. Students should make a habit of checking their Canvas inbox at least daily.

Code of Mutual Respect and Cooperation

The Eberly College of Science Code of Mutual Respect and Cooperation embodies the values that we hope our faculty, staff, and students possess and will endorse to make The Eberly College of Science a place where every individual feels respected and valued, as well as challenged and rewarded. Please visit the link to review the 12 points that comprise this code.

Academic Support

The Eberly College of Science is committed to the academic success of students enrolled in the College's courses and undergraduate programs. When in need of help, students can utilize various College and University wide resources for learning assistance. https://science.psu.edu/current-students/support-network.

Disability Accommodation Statement

Penn State welcomes students with disabilities into the University’s educational programs. Every Penn State campus has an office for students with disabilities. Student Disability Resources (SDR) website provides contact information for every Penn State campus (http://equity.psu.edu/sdr/disability-coordinator). For further information, please visit Student Disability Resources website (http://equity.psu.edu/sdr/).

In order to receive consideration for reasonable accommodations, you must contact the appropriate disability services office at the campus where you are officially enrolled, participate in an intake interview, and provide documentation: See documentation guidelines (http://equity.psu.edu/sdr/guidelines). If the documentation supports your request for reasonable accommodations, your campus disability services office will provide you with an accommodation letter. Please share this letter with your instructors and discuss the accommodations with them as early as possible. You must follow this process for every semester that you request accommodations.

Regardless of whether you have a documented disability, please feel free to let the instructor know if you have a suggestion for how an assignment's accessiblity could be improved.

Counseling & Psychological Services Statement

Many students at Penn State face personal challenges or have psychological needs that may interfere with their academic progress, social development, or emotional wellbeing. The university offers a variety of confidential services to help you through difficult times, including individual and group counseling, crisis intervention, consultations, online chats, and mental health screenings. These services are provided by staff who welcome all students and embrace a philosophy respectful of clients’ cultural and religious backgrounds, and sensitive to differences in race, ability, gender identity and sexual orientation.

  • Counseling and Psychological Services at University Park (CAPS): http://studentaffairs.psu.edu/counseling/, 814-863-0395

  • Penn State Crisis Line (24 hours/7 days/week): 877-229-6400

  • Crisis Text Line (24 hours/7 days/week): Text LIONS to 741741

Reporting Educational Equity Concerns

Penn State takes great pride to foster a diverse and inclusive environment for students, faculty, and staff. Acts of intolerance, discrimination, or harassment due to age, ancestry, color, disability, gender, gender identity, national origin, race, religious belief, sexual orientation, or veteran status are not tolerated and can be reported through Educational Equity via the Report Bias webpage (http://equity.psu.edu/reportbias/).