Description
A major portion of this course will be an in-depth exploration of computational humanities tools and algorithms.
You and a partner will be researching and applying one such tool to a dataset of your choice. This
will involve a proposal, three presentations, a paper, and code.
Sample Project Topics
This list is by no means exhaustive of possible topics. Please use it as inspiration for
your own exploration of the concepts in this course.
Proposal 10%
The first step in your project is developing a proposal. This two-page document will be
due at the beginning of class on March 1st. There are three required sections of your proposal.
Research Question 4%
Most importantly, you must clearly state a research question you wish to address through
your project. Introduce your topic and its relevance to the course material.
Data Set 4%
Every project must involve the analysis of data. Identify the corpus of data you will be
using for your analysis. Who gathered and organized this data? How large is the data (number
of individual files/documents/images, total file size?)
Academic References 2%
Include references to sources you will use to inform your analysis. At least one source
should be an academic paper, published in a peer-reviewed journal or conference.
Presentations 20%
You will give three presentations about your project.
First Presentation 5%
Presentation Schedule
You will be giving a 4-5 minute overview of your project to the rest of the class.
You should focus on the three main elements of the proposal above, clearly explaining
the research question, your selected dataset, and the supporting references you will
use.
Second Update 5%
Your presentation will be 3 minutes, covering two pieces of your project, one slide
for each:
First, you will overview the algorithmic details involved. Is your approach to answer your
question rule-based, probabilistic, or a mixture of the two? Does it involve machine learning
algorithms? How is your data represented and processed?
Second, you will describe a graph/image/sound-file of results based on your research. As you will be in the
middle of your project, this could be an aspirational graph (IF everything works correctly,
these are the results I hope to get). Clearly identify your axes, use colors and
labels to quickly convey your information.
To ensure that these presentations all fit within the April 10th class period, I will need
you to submit your slides to me via Moodle by April 9th at 11:55pm. I will assemble them
all into one large presentation.
Final Presentation 10%
Finally, you will give a 10 minute in-class presentation. Time and material should be
split evenly between all project partners. Your objective in the presentation is to
fill us in on the main ideas of how you approached and analyzed your topic. Leave at
least one (1) minute for questions at the end of your presentation.
Practice is strongly encouraged prior to your actual presentation. The best way to be
comfortable with presenting in front of an audience is to practice. You will be graded
on both the organization and the presentation of your talk. The 10 minute time-limit
will be strictly enforced to ensure equal time for all presentations.
Paper and Code 20%
Summary Paper 15%
Your paper must be at least 12 pages. Your paper should be readable by a layperson who
is not familiar with your tools and algorithms but has a basic understanding of computer
science; essentially, write the paper for yourself before you took this class, based on
the prereqs of CSCI 150: Foundation of CSCI. You should include the following sections
- Introduction
- Background
- Data
- Analysis
- Conclusion
- Future Work
- References
Be sure to make good use of figures and graphs to demonstrate your results, use proper
grammar and spelling, and use proper citations.
Data and Implementation 5%
Finally include any code and IPython documents along with your dataset. Your notebook should
be well-organized including headers and Markdown blocks to denote the different
sections of your analysis.
© Mark Goadrich, Hendrix College