An article in the Journal of Statistics Education describes the development of a repository for real-world datasets associated with actual peer-reviewed publications. You have to fill out a short form agreeing not to do bad things with the data before downloading any of these datasets. Each dataset has a nice data dictionary.
- Cleveland Clinic Statistical Dataset Repository. Available in html format
An earlier version of this page was published on new.pmean.com.