I S K O

Subject (of documents)

by Birger Hjørland

Table of contents:
1. Introduction
2. Theoretical views
    2.1a Charles Ammi Cutter (1837-1903)
    2.1b Melvil Dewey (1851-1931)
    2.2 S. R. Ranganathan (1892-1972)
    2.3 Patrick Wilson (1927-2003)
    2.4 "Content oriented" versus "request oriented" views
    2.5 Issues of subjectivity and objectivity
    2.6 The subject knowledge view
    2.7 Other views and definitions
3. Related concepts
    3.1 Words versus concepts versus subjects
    3.2 Aboutness
    3.3 Topic
    3.4 Isness
    3.5 Ofness
    3.6 Theme
    3.7 Content
4. Conclusion
Acknowledgments
References
Colophon
Abstract:
This article presents and discusses the concept "subject" or subject matter (of documents) as it has been examined in library and information science (LIS) for more than 100 years. Different theoretical positions are outlined and it is found that the most important distinction is between document-oriented views versus request-oriented views. The document-oriented view conceive subject as something inherent in documents, whereas the request-oriented view (or the policy based view) understand subject as an attribution made to documents in order to facilitate certain uses of them. Related concepts such as concepts, aboutness, topic, isness, ofness and content are also briefly presented. The conclusion is that the most fruitful way of defining "subject" (of a document) is the documents informative or epistemological potentials, that is, the documents potentials of informing users and advance the development of knowledge.

1. Introduction

In → library and information science (LIS), → documents (such as books, articles and pictures) are classified, indexed and searched by subject (as well as by other attributes such as author, → genre and language). This makes "subject" a fundamental concept in this field (see Golub 2014 for a recent text). This use of "subject" in LIS is part of the broader use of the concept that refers to all kinds of utterances ("what is he talking about"). LIS specialists assign subject labels to documents to make them findable/retrievable. Such professionally assigned subject labels compete with other → subject access points such as words from titles, abstracts and full-text, bibliographic references, user → tagging etc. Therefore, research in subject representation is not limited to professionally assigned subject labels but includes the study of all possible subject access points.

There are many ways to produce subject representations and in general there is not always consensus about which subject should be attributed to a given document. As stated by Lancaster (2003, 21), it is important to distinguish the conceptual analysis and the translation stages in indexing and classification. In conceptual analysis, subjects are attributed to documents and in the translation stage subject labels are assigned to documents. There tend to be great variation among indexers and classifiers in subject analysis and choice of subject labels, as measured, for example, by so-called inter-indexer consistency studies, see Saracevic (2008). To optimize subject representation and searching, we need to have a deeper understanding of the questions

What is the criterion that a given subject should be attributed to a given document?
What is to be understood by the statement 'document A belongs to subject category X'?
What is a subject?

This issue has been debated in the field for more than 100 years, often by using other terms such as aboutness or topic (cf., below).

One may think that the concept "subject" in this connection is self-evident and in no need for theoretical exploration. The claim of this article is, however, that it is a basic concept with different meanings and that a fruitful understanding of it is of fundamental importance for LIS. What Tredinnick wrote about the concepts information, knowledge, data, document and text is equally true for subject:

The difficulty in reaching agreement about their meaning in part derives from the kinds of research questions that are addressed, but also in part from fundamental differences in the conceptual outlooks into which they are slotted. Implicit in this is an ongoing cycle of appropriation and reappropriation of the meaning of these contested terms for particular ends. (Tredinnick 2006, 19)

Therefore, we have to consider the different theoretical outlooks in order to decide which outlook and thereby understanding of "subject" is most fruitful for knowledge organization.

I S K O

Encyclopedia of Knowledge Organization

Subject (of documents)

1. Introduction

2. Theoretical views

2.1a Charles Ammi Cutter (1837-1903)

2.1b Melvil Dewey (1851-1931)

2.2 S. R. Ranganathan (1892-1972)

2.3 Patrick Wilson (1927-2003)

2.4 "Content oriented" versus "request oriented" views

2.5 Issues of subjectivity and objectivity

2.6 The subject knowledge view

2.7 Other views and definitions

3. Related concepts

3.1 Words versus concepts versus subjects

3.2 Aboutness

3.3 Topic

3.4 Isness

3.5 Ofness

3.6 Theme

3.7 Content

4. Conclusion

Acknowledgements

References