Content analysis is the study of documents and communication artifacts, which might be texts of various formats, pictures, audio or video. Social scientists use content analysis to examine patterns in communication in a replicable and systematic manner.[1] One of the key advantages of using content analysis to analyse social phenomena is their non-invasive nature, in contrast to simulating social experiences or collecting survey answers.
Practices and philosophies of content analysis vary between academic disciplines. They all involve systematic reading or observation of texts or artifacts which are assigned labels (sometimes called codes) to indicate the presence of interesting, meaningful pieces of content.[2][3] By systematically labeling the content of a set of texts, researchers can analyse patterns of content quantitatively using statistical methods, or use qualitative methods to analyse meanings of content within texts.
Computers are increasingly used in content analysis to automate the labeling (or coding) of documents. Simple computational techniques can provide descriptive data such as word frequencies and document lengths. Machine learning classifiers can greatly increase the number of texts that can be labeled, but the scientific utility of doing so is a matter of debate. Further, numerous computer-aided text analysis (CATA) computer programs are available that analyze text for predetermined linguistic, semantic, and psychological characteristics.[4]
Goals
Content analysis is best understood as a broad family of techniques. Effective researchers choose techniques that best help them answer their substantive questions. That said, according to Klaus Krippendorff, six questions must be addressed in every content analysis:[5]
Which data are analyzed?
How are the data defined?
From what population are data drawn?
What is the relevant context?
What are the boundaries of the analysis?
What is to be measured?
The simplest and most objective form of content analysis considers unambiguous characteristics of the text such as word frequencies, the page area taken by a newspaper column, or the duration of a radio or television program. Analysis of simple word frequencies is limited because the meaning of a word depends on surrounding text. Key Word In Context (KWIC) routines address this by placing words in their textual context. This helps resolve ambiguities such as those introduced by synonyms and homonyms.
A further step in analysis is the distinction between dictionary-based (quantitative) approaches and qualitative approaches. Dictionary-based approaches set up a list of categories derived from the frequency list of words and control the distribution of words and their respective categories over the texts. While methods in quantitative content analysis in this way transform observations of found categories into quantitative statistical data, the qualitative content analysis focuses more on the intentionality and its implications. There are strong parallels between qualitative content analysis and thematic analysis.[6]
Qualitative and quantitative content analysis
Quantitative content analysis highlights frequency counts and statistical analysis of these coded frequencies.[7] Additionally, quantitative content analysis begins with a framed hypothesis with coding decided on before the analysis begins. These coding categories are strictly relevant to the researcher's hypothesis. Quantitative analysis also takes a deductive approach.[8] Examples of content-analytical variables and constructs can be found, for example, in the open-access database DOCA. This database compiles, systematizes, and evaluates relevant content-analytical variables of communication and political science research areas and topics.
Siegfried Kracauer provides a critique of quantitative analysis, asserting that it oversimplifies complex communications in order to be more reliable. On the other hand, qualitative analysis deals with the intricacies of latent interpretations, whereas quantitative has a focus on manifest meanings. He also acknowledges an "overlap" of qualitative and quantitative content analysis.[7] Patterns are looked at more closely in qualitative analysis, and based on the latent meanings that the researcher may find, the course of the research could be changed. It is inductive and begins with open research questions, as opposed to a hypothesis.[8]
Codebooks
The data collection instrument used in content analysis is the codebook or coding scheme. In qualitative content analysis the codebook is constructed and improved during coding, while in quantitative content analysis the codebook needs to be developed and pretested for reliability and validity before coding.[4] The codebook includes detailed instructions for human coders plus clear definitions of the respective concepts or variables to be coded plus the assigned values.
According to current standards of good scientific practice, each content analysis study should provide their codebook in the appendix or as supplementary material so that reproducibility of the study is ensured. On the Open Science Framework (OSF) server of the Center for Open Science a lot of codebooks of content analysis studies are freely available via search for "codebook".
Furthermore, the Database of Variables for Content Analysis (DOCA) provides an open access archive of pretested variables and established codebooks for content analyses.[9] Measures from the archive can be adopted in future studies to ensure the use of high-quality and comparable instruments. DOCA covers, among others, measures for the content analysis of fictional media and entertainment (e.g., measures for sexualization in video games[10]), of user-generated media content (e.g., measures for online hate speech[11]), and of news media and journalism (e.g., measures for stock photo use in press reporting on child sexual abuse,[12] and measures of personalization in election campaign coverage[13]).
Computational tools
With the rise of common computing facilities like PCs, computer-based methods of analysis are growing in popularity.[14][15][16] Answers to open ended questions, newspaper articles, political party manifestos, medical records or systematic observations in experiments can all be subject to systematic analysis of textual data.
By having contents of communication available in form of machine readable texts, the input is analyzed for frequencies and coded into categories for building up inferences.
Computer-assisted analysis can help with large, electronic data sets by cutting out time and eliminating the need for multiple human coders to establish inter-coder reliability. However, human coders can still be employed for content analysis, as they are often more able to pick out nuanced and latent meanings in text. A study found that human coders were able to evaluate a broader range and make inferences based on latent meanings.[17]
Reliability and Validity
Robert Weber notes: "To make valid inferences from the text, it is important that the classification procedure be reliable in the sense of being consistent: Different people should code the same text in the same way".[18] The validity, inter-coder reliability and intra-coder reliability are subject to intense methodological research efforts over long years.[5]
Neuendorf suggests that when human coders are used in content analysis at least two independent coders should be used. Reliability of human coding is often measured using a statistical measure of inter-coder reliability or "the amount of agreement or correspondence among two or more coders".[4] Lacy and Riffe identify the measurement of inter-coder reliability as a strength of quantitative content analysis, arguing that, if content analysts do not measure inter-coder reliability, their data are no more reliable than the subjective impressions of a single reader.[19]
According to today's reporting standards, quantitative content analyses should be published with complete codebooks and for all variables or measures in the codebook the appropriate inter-coder or inter-rater reliability coefficients should be reported based on empirical pre-tests.[4][20][21] Furthermore, the validity of all variables or measures in the codebook must be ensured. This can be achieved through the use of established measures that have proven their validity in earlier studies. Also, the content validity of the measures can be checked by experts from the field who scrutinize and then approve or correct coding instructions, definitions and examples in the codebook.
Kinds of text
There are five types of texts in content analysis:
Content analysis is research using the categorization and classification of speech, written text, interviews, images, or other forms of communication. In its beginnings, using the first newspapers at the end of the 19th century, analysis was done manually by measuring the number of columns given a subject. The approach can also be traced back to a university student studying patterns in Shakespeare's literature in 1893.[22]
Over the years, content analysis has been applied to a variety of scopes. Hermeneutics and philology have long used content analysis to interpret sacred and profane texts and, in many cases, to attribute texts' authorship and authenticity.[3][5]
In recent times, particularly with the advent of mass communication, content analysis has known an increasing use to deeply analyze and understand media content and media logic.
The political scientist Harold Lasswell formulated the core questions of content analysis in its early-mid 20th-century mainstream version: "Who says what, to whom, why, to what extent and with what effect?".[23] The strong emphasis for a quantitative approach started up by Lasswell was finally carried out by another "father" of content analysis, Bernard Berelson, who proposed a definition of content analysis which, from this point of view, is emblematic: "a research technique for the objective, systematic and quantitative description of the manifest content of communication".[24]
Quantitative content analysis has enjoyed a renewed popularity in recent years thanks to technological advances and fruitful application in of mass communication and personal communication research. Content analysis of textual big data produced by new media, particularly social media and mobile devices has become popular. These approaches take a simplified view of language that ignores the complexity of semiosis, the process by which meaning is formed out of language. Quantitative content analysts have been criticized for limiting the scope of content analysis to simple counting, and for applying the measurement methodologies of the natural sciences without reflecting critically on their appropriateness to social science.[25] Conversely, qualitative content analysts have been criticized for being insufficiently systematic and too impressionistic.[25] Krippendorff argues that quantitative and qualitative approaches to content analysis tend to overlap, and that there can be no generalisable conclusion as to which approach is superior.[25]
Content analysis can also be described as studying traces, which are documents from past times, and artifacts, which are non-linguistic documents. Texts are understood to be produced by communication processes in a broad sense of that phrase—often gaining mean through abduction.[3][26]
Latent and manifest content
Manifest content is readily understandable at its face value. Its meaning is direct. Latent content is not as overt, and requires interpretation to uncover the meaning or implication.[27]
Uses
Holsti groups fifteen uses of content analysis into three basic categories:[28]
describe and make inferences about characteristics of a communication
make inferences about the effects of a communication.
He also places these uses into the context of the basic communication paradigm.
The following table shows fifteen uses of content analysis in terms of their general purpose, element of the communication paradigm to which they apply, and the general question they are intended to answer.
Uses of Content Analysis by Purpose, Communication Element, and Question
Purpose
Element
Question
Use
Make inferences about the antecedents of communications
Note. Purpose, communication element, & question from Holsti.[28] Uses primarily from Berelson[29] as adapted by Holsti.[28]
As a counterpoint, there are limits to the scope of use for the procedures that characterize content analysis. In particular, if access to the goal of analysis can be obtained by direct means without material interference, then direct measurement techniques yield better data.[30] Thus, while content analysis attempts to quantifiably describe communications whose features are primarily categorical——limited usually to a nominal or ordinal scale——via selected conceptual units (the unitization) which are assigned values (the categorization) for enumeration while monitoring intercoder reliability, if instead the target quantity manifestly is already directly measurable——typically on an interval or ratio scale——especially a continuous physical quantity, then such targets usually are not listed among those needing the "subjective" selections and formulations of content analysis.[31][32][33][34][35][36][20][37] For example (from mixed research and clinical application), as medical images communicate diagnostic features to physicians, neuroimaging's stroke (infarct) volume scale called ASPECTS is unitized as 10 qualitatively delineated (unequal) brain regions in the middle cerebral artery territory, which it categorizes as being at least partly versus not at all infarcted in order to enumerate the latter, with published series often assessing intercoder reliability by Cohen's kappa. The foregoing italicized operations impose the uncredited form of content analysis onto an estimation of infarct extent, which instead is easily enough and more accurately measured as a volume directly on the images.[38][39] ("Accuracy ... is the highest form of reliability."[40]) The concomitant clinical assessment, however, by the National Institutes of Health Stroke Scale (NIHSS) or the modified Rankin Scale (mRS), retains the necessary form of content analysis. Recognizing potential limits of content analysis across the contents of language and images alike, Klaus Krippendorff affirms that "comprehen[sion] ... may ... not conform at all to the process of classification and/or counting by which most content analyses proceed,"[41] suggesting that content analysis might materially distort a message.
Developing the initial coding scheme
The process of the initial coding scheme or approach to coding is contingent on the particular content analysis approach selected. Through a directed content analysis, the scholars draft a preliminary coding scheme from pre-existing theory or assumptions. While with the conventional content analysis approach, the initial coding scheme developed from the data.
Conventional process of coding
With either approach above, researchers may immerse themselves into the data to obtain an overall picture. A consistent and clear unit of coding is vital, with the choices ranging from a single word to several paragraphs and from texts to iconic symbols. Lastly, researchers construct the relationships between codes by sorting out them within specific categories or themes.[42]
^ abKracauer, Siegfried (1952). "The Challenge of Qualitative Content Analysis". Public Opinion Quarterly. 16 (4, Special Issue on International Communications Research): 631. doi:10.1086/266427. ISSN0033-362X.
^Conway, Mike (March 2006). "The Subjective Precision of Computers: A Methodological Comparison with Human Coding in Content Analysis". Journalism & Mass Communication Quarterly. 83 (1): 186–200. doi:10.1177/107769900608300112. ISSN1077-6990. S2CID143292050.
^Lacy, Stephen R; Riffe, Daniel (1993). "Sins of Omission and Commission in Mass Communication Quantitative Research". Journalism & Mass Communication Quarterly. 70 (1): 126–132. doi:10.1177/107769909307000114. S2CID144076335.
^Lasswell, Harold (1948). "The Structure and Function of Communication in Society". In Bryson, L. (ed.). The Communication of Ideas(PDF). New York: Harper and Row. p. 216.
^Berelson, B. (1952). Content Analysis in Communication Research. Glencoe: Free Press. p. 18.
^Jang-Hwan Lee; Young-Gul Kim; Sung-Ho Yu (2001). "Stage model for knowledge management". Proceedings of the 34th Annual Hawaii International Conference on System Sciences. IEEE Comput. Soc. p. 10. doi:10.1109/hicss.2001.927103. ISBN0-7695-0981-9. S2CID34182315.
^ abcHolsti, Ole R. (1969). Content Analysis for the Social Sciences and Humanities. Reading, MA: Addison-Wesley. pp. 14–93. (Table 2-1, page 26).
^Berelson, Bernard (1952). Content Analysis in Communication Research. Glencoe, Ill: Free Press.
^Holsti, Ole R. (1969). Content Analysis for the Social Sciences and Humanities. Reading, MA: Addison-Wesley. pp. 15–16.
^Holsti, Ole R. (1969). Content Analysis for the Social Sciences and Humanities. Reading, MA: Addison-Wesley.
^Neuendorf, Kimberly A. (2002). The Content Analysis Guidebook. Thousand Oaks, CA: Sage. pp. 52–54. ISBN0761919783. (On content analysis's descriptive role).
^Agresti, Alan (2002). Categorical Data Analysis (2nd ed.). Hoboken, NJ: Wiley. pp. 2–4. ISBN0471360937. (On the meanings of "categorical" and other measurement scales).
^Hall, Calvin S.; Van de Castle, Robert L. (1966). The Content Analysis of Dreams. New York: Appleton-Century-Crofts. pp. 1–16. (Chapter 1, "The Methodology of Content Analysis," on the quantitative nature and uses of content analysis, and quoting "subjective" from page 12).
Roberts, Carl, ed. (1997). Text Analysis for the Social Sciences: Methods for Drawing Inferences from Texts and Transcripts. Mahwah, NJ: Lawrence Erlbaum.
Wimmer, Roger; Dominick, Joseph (2005). Mass Media Research: An Introduction (8th ed.). Belmont, CA: Wadsworth.
Massadio Haïdara Haïdara bermain untuk Mali pada 2022Informasi pribadiTanggal lahir 2 Desember 1992 (umur 31)Tempat lahir Trappes, PrancisTinggi 179 cm (5 ft 10 in)[1]Posisi bermain Bek kiriInformasi klubKlub saat ini LensNomor 21Karier junior2002–2003 La Verrière2003–2005 Versailles2005–2008 Boulogne-Billancourt2008–2010 NancyKarier senior*Tahun Tim Tampil (Gol)2010–2013 Nancy 46 (0)2013–2018 Newcastle United 39 (0)2018– Lens 133 (4)Tim nasional�...
American indie rock band Wye OakWye Oak in 2014Background informationOriginBaltimore, MarylandGenresIndie rockindie folkdream popYears active2006–presentLabelsMerge, City SlangMembers Andy Stack Jenn Wasner William Joseph Hackney Websitewww.wyeoakmusic.com Wye Oak is an American indie rock duo from Baltimore composed of Andy Stack (drums, keyboards, backup vocals) and Jenn Wasner (vocals, guitars and bass).[1] Their sound has been described as earnest folk-influenced indie rock with...
Bushido, 2018 Bushidos Logo Bushido (* 28. September 1978 in Bonn als Anis Mohamed Youssef Ferchichi), auch bekannt unter dem Pseudonym Sonny Black, ist ein deutscher Rapper aus Berlin-Tempelhof, dessen Stil sich an den US-amerikanischen Gangsta-Rap anlehnt. Er arbeitet auch als Produzent und ist Inhaber des Plattenlabels Ersguterjunge. Inhaltsverzeichnis 1 Leben 1.1 Frühe Jugend 1.2 Karriere 1.2.1 Anfänge und Aggro-Berlin-Zeit (1998–2003) 1.2.2 Ersguterjunge-Gründung und Aufstieg zum Ma...
Cet article est une ébauche concernant l’Ukraine. Vous pouvez partager vos connaissances en l’améliorant (comment ?) selon les recommandations des projets correspondants. Raïon de Podil (uk) Подільський район Héraldique Drapeau Administration Pays Ukraine Ville Kiev Indicatif téléphonique (+380) Histoire de la division Création 1921 Démographie Population 198 123 hab. (2016) Densité 5 820 hab./km2 Géographie Coordonnées 50° 28′...
Hauptportal des Sächsischen Ständehauses in Dresden, Sitz des Sächsischen Landtages in der Weimarer Republik Der Landtag des Freistaats Sachsen war das Landesparlament Sachsens zur Zeit der Weimarer Republik. Er konstituierte sich auf der Grundlage der am 26. Oktober 1920 verabschiedeten neuen Verfassung am 7. Dezember 1920 und wurde nach der Machtergreifung durch die Nationalsozialisten 1933 zum 30. Januar 1934 offiziell aufgelöst. Vorgänger waren die nach der Novemberrevolution am...
羅德島 USS Rhode Island BB-17羅德島號。攝於1907年5月。概觀艦種戰艦艦級維珍尼亞級製造廠霍河造船廠動工1902年5月1日下水1904年5月17日服役1906年2月19日退役1920年6月30日結局1923年11月1日出售拆解除籍1922年7月12日技术数据標準排水量設計:14,948噸全長整體:441呎3吋水線:435呎全寬水線:76呎3吋吃水23.8呎鍋爐12座功率設計:19,000軸馬力試航:23,089軸馬力最高速度19節試航:19.18節續...
2022 single by Dean LewisHow Do I Say GoodbyeSingle by Dean Lewisfrom the album The Hardest Love Released2 September 2022[1]Length2:43LabelIsland AustraliaUniversal AustraliaSongwriter(s)Dean LewisJon HumeProducer(s)Dean LewisJon HumeEd HollowayNick AtkinsonDean Lewis singles chronology Lost Without You (2022) How Do I Say Goodbye (2022) Music videoHow Do I Say Goodbye on YouTube How Do I Say Goodbye is a song by Australian singer and songwriter Dean Lewis. The song was released on 2 ...
جزء من سلسلةالمرأة في المجتمع مجتمع قطاع الأعمال التعليم القوى العاملة (مجالس الإدارة) السياسة (في أستراليا) الخدمة العسكرية للمرأة في التاريخ الحقوق القانونية التاريخ جائزة نوبل حقوق الحيوانات العلوم والتكنولوجيا الطب العلوم الهندسة الحوسبة الفنون والعلوم الإنسانية الأ
Satō Eisaku Satō Eisaku (jap. 佐藤 榮作; * 27. März 1901 im heutigen Tabuse, Yamaguchi; † 3. Juni 1975 in Tokio) war ein japanischer Politiker, der der liberaldemokratischen Partei angehörte. Er war von 1958 bis 1960 Finanzminister und von 1964 bis 1972 der 39. Premierminister Japans. In Anerkennung für Japans Beitritt zum Atomwaffensperrvertrag erhielt Satō 1974 als erster Asiate den Friedensnobelpreis. Von 1964 bis 1975 war er Präsident der International Karate Organization Kyo...
Helikopter Bell UH-1D di perang Vietnam 1966. Penerbangan Angkatan Darat adalah unit yang seringkali digambarkan sama dengan Angkatan Udara. Unit ini umumnya berbeda dengan Angkatan Udara, baik peran dan fungsinya.Sebelum pembentukan Penerbangan Angkatan Darat, banyak pasukan darat dari berbagai negara yang masih bergantung dengan Angkatan Udara, sehingga begitu pentingnya suatu unit penerbangan tersendiri dalam Angkatan Darat.Sejumlah negara telah membangun suatu unit penerbangan angkatan da...
Ship-based reconnaissance floatplane He 60 Heinkel He 60. Role Reconnaissance seaplaneType of aircraft Manufacturer Heinkel First flight Early 1933 Introduction 1933 Retired 1943 Primary users LuftwaffeKriegsmarine Number built 361 The Heinkel He 60 was a German single-engined biplane reconnaissance seaplane designed to be catapulted from Kriegsmarine (German navy) warships of the 1930s. Development and design The Heinkel He 60 was designed by Heinkel engineer Reinhold Mewes, the designe...
Kirwan State High School, 2017 Annual Report Kirwan State High School is a coeducational state school located in Kirwan, Townsville, Queensland, Australia, approximately 12 kilometres (7 mi) from the Townsville CBD. In 2017, it has 1997 students with 156 teachers (145 full-time equivalent) and 75 non-teaching staff (59 full-time equivalent),[1] making it the largest high school in North Queensland. It is located in the former city of Thuringowa.[2] History The school enro...
City in Minnesota, United States City in Minnesota, United StatesSkylineCityLocation of Skyline, MinnesotaCoordinates: 44°08′26″N 94°02′02″W / 44.14056°N 94.03389°W / 44.14056; -94.03389CountryUnited StatesStateMinnesotaCountyBlue EarthGovernment • TypeMayor - Council • MayorPaige Attarian[1]Area[2] • Total0.19 sq mi (0.50 km2) • Land0.19 sq mi (0.50 km2) •&...
American musician This biography of a living person relies too much on references to primary sources. Please help by adding secondary or tertiary sources. Contentious material about living persons that is unsourced or poorly sourced must be removed immediately, especially if potentially libelous or harmful.Find sources: Theo Logian – news · newspapers · books · scholar · JSTOR (December 2013) (Learn how and when to remove this template message) Theo Lo...
Untuk buku 1937 karya Mao Zedong, lihat Tentang Perang Gerilya. Perang Gerilya Berkas:Che Guevara Guerrilla Warfare.jpgPerang Gerilya, yang diterbitkan oleh Ocean Books (2006)PengarangChe GuevaraBahasaInggrisGenreSejarah, Taktik Militer, Ideologi RevolusionerPenerbitNew York: Monthly Review PressTanggal terbit1961Jenis mediaCetakHalaman175ISBNISBN 0-8032-7075-5 Perang Gerilya (Spanyol: La Guerra de Guerrillas) adalah sebuah buku karya revolusioner Marxis Che Guevara yang ditulis...
Belize terbagi kedalam 6 distrik, yakni: Distrik Distrik di Belize Distrik Ibu kota Area (km²) Area (sq. mi) Populasi (2010 census) Kode 2 huruf Kode 3 huruf Belize Belize City 4,204 1,623 89,247 BZ BZD Cayo San Ignacio 5,338 2,061 73,302 CY CYO Corozal Corozal Town 1,860 718 40,324 CZ CZL Orange Walk Orange Walk Town 4,737 1,829 45,419 OW OWK Stann Creek Dangriga 2,176 840 32,166 SC SCK Toledo Punta Gorda 4,649 1,795 30,538 TO TOL Lihat pula Belize Referensi Tentang Belize Diarsipkan 2...
Thai football club This article needs additional citations for verification. Please help improve this article by adding citations to reliable sources. Unsourced material may be challenged and removed.Find sources: Thailand Tobacco Monopoly F.C. – news · newspapers · books · scholar · JSTOR (October 2014) (Learn how and when to remove this template message) Football clubTTM FC ทีทีเอ็ม เอฟซีFull nameThailand Tobacco Monopoly Fo...
Historical term for northern and central Asia Not to be confused with Strait of Tartary or Tatarstan. Map of independent Tartary (in yellow) and Chinese Tartary (in violet), in 1806. Tartary (Latin: Tartaria; French: Tartarie; German: Tartarei; Russian: Тартария, romanized: Tartariya) or Tatary (Russian: Татария, romanized: Tatariya) was a blanket term used in Western European literature and cartography for a vast part of Asia bounded by the Caspian Sea, the Ural Moun...