資料新聞學

数据新闻(台湾或称资料新闻学)(英語:data journalism)是指透過對大量資料集進行分析與篩檢後來產出新聞報導(故事)的一種新聞處理程序。在資料新聞學中,我們常常會使用到網路上可自由取得的開放資料,然後使用開放原始碼軟體來處理分析[1]。資料新聞學希望能服務大眾、協助消費者、經理管理人、政治人物來了解固定出現的模式,並根據出現的現像擬定策略。因此,資料新聞學將會使新聞記者在社會上扮演新的角色。

定義

資料新聞報導的處理流程
資料新聞報導的處理流程

根據資訊架構師和多媒體新聞記者 Mirko Lorenz 的說法,資訊新聞學是一個包含了下列這些元素的完整 workflow (工作流程) :將資料純淨化、結構化來「深入資料」,挖掘特定資訊來「過濾資料」,再將資料「視覺化」以做出報導。[2]另外也可以將這個過處理過程擴充加入其他步驟,使其適用於個人層面或是更廣的公共層面。

資料新聞學訓練員暨作家Paul Bradshaw用一種類似的方式來描述這種資料導向的新聞工作:必須要能夠使用像是MySQL或是Python等資料處理軟體來「找到」資料;然後「訊問」它,也就是要能夠理解當中的術語以及統計學;最後藉由開放原始碼工具將其「視覺化」及「混搭」。[3]

另外一個以結果導向來定義這個詞的資料記者暨網路趨勢研究者(web strategist)Henk van Ess (2012)[4]認為「資料導向的新聞工作使得記者能夠找到尚未被發現的事件,或是透過這套搜尋資料的流程來找到新的角度完成這份報導,也就是運用可行的開放原始碼工具對這些資料(可能是任何形式)加工並呈現出來。」Van Ess 認為一些資料導向的工作流程會使其產品「不在好敘事的範疇裡」,因為做出來的結果在於強調問題,而非闡述問題。「一個好的資料導向生產流程擁有不同的層面。它不只能夠讓你找到只對你重要,且個人化的內容,還能夠鑽到相關的細節裡讓你能夠廣覽全局。」


已隱藏部分未翻譯内容,歡迎參與翻譯

基於資料的新聞報導

Telling stories based on the data is the primary goal. The findings from data can be transformed into any form of journalistic writing. Visualizations can be used to create a clear understanding of a complex situation. Furthermore, elements of storytelling can be used to illustrate what the findings actually mean, from the perspective of someone who is affected by a development. This connection between data and story can be viewed as a "new arc" trying to span the gap between developments that are relevant, but poorly understood, to a story that is verifiable, trustworthy, relevant and easy to remember.

資料品質

In many investigations the data that can be found might have omissions or is misleading. As one layer of data-driven journalism a critical examination of the data quality is important. In other cases the data might not be public or is not in the right format for further analysis, e.g. is only available in a PDF. Here the process of data-driven journalism can turn into stories about data quality or refusals to provide the data by institutions. As the practice as a whole is in early development steps, examinations of data sources, data sets, data quality and data format are therefore an equally important part of this work.

資料新聞學和信任的力量

Based on the perspective of looking deeper into facts and drivers of events, there is a suggested change in media strategies: In this view the idea is to move "from attention to trust". The creation of attention, which has been a pillar of media business models has lost its relevance because reports of new events are often faster distributed via new platforms such as Twitter than through traditional media channels. On the other hand, trust can be understood as a scarce resource. While distributing information is much easier and faster via the web, the abundance of offerings creates costs to verify and check the content of any story create an opportunity. The view to transform media companies into trusted data hubs has been described in an article cross-published in February 2011 on Owni.eu[5] and Nieman Lab.[6]

資料新聞學的進行過程

The process to transform raw data into stories is aking to a refinement and transformation. The main goal is to extract information recipients can act upon. The task of a data journalist is to extract what is hidden. This approach can be applied to almost any context, such as finances, health, environment or other areas of public interest.

倒金字塔資料新聞學

In 2011, Paul Bradshaw introduced a model, he called "The Inverted Pyramid of Data Journalism"页面存档备份,存于互联网档案馆).

進行步驟

In order to achieve this, the process should be split up into several steps. While the steps leading to results can differ, a basic distinction can be made by looking at six phases:

  1. Find: Searching for data on the web
  2. Clean: Process to filter and transform data, preparation for visualization
  3. Visualize: Displaying the pattern, either as a static or animated visual
  4. Publish: Integrating the visuals, attaching data to stories
  5. Distribute: Enabling access on a variety of devices, such as the web, tablets and mobile
  6. Measure: Tracking usage of data stories over time and across the spectrum of uses.

步驟描述

尋找資料

Data can be obtained directly from governmental databases such as data.gov, data.gov.uk and World Bank Data API[7] but also by placing Freedom of Information requests to government agencies; some requests are made and aggregated on websites like the UK's What Do They Know. While there is a worldwide trend towards opening data, there are national differences as to what extend that information is freely available in usable formats. If the data is in a webpage, scrapers are used to generate a spreadsheet. Examples of scrapers are: ScraperWiki, Firefox plugin OutWit Hub or Needlebase (note: Needlebase will be retired June 1, 2012[8]). In other cases OCR-Software can be used to get data from PDFs.

Data can also be created by the public through crowd sourcing, as shown in March 2012 at the Datajournalism Conference in Hamburg by Henk van Ess [9]

資料清洗

Usually data is not in a format that is easy to visualize. Examples being that there are too many data points or that the rows and columns need to be sorted differently. Another issue is that once investigated many datasets need to be cleaned, structured and transformed. Various open source tools like Google Refine, Data Wrangler and Google Spreadsheets[10] allow uploading, extracting or formatting data.

資料視覺化

To visualize data in the form of graphs and charts, applications such as Many Eyes or Tableau Public are available. Yahoo! Pipes and Open Heat Map[11] are examples of tools that enable the creation of maps based on data spreadsheets. The number of options and platforms is expanding. Some new offerings provide options to search, display and embed data, an example being Timetric.[12]

To create meaningful and relevant visualizations, journalists use a growing number of tools. There are by now, several descriptions what to look for and how to do it. Most notable published articles are:

As of 2011, the use of HTML 5 libraries using the canvas tag is gaining in popularity. There are numerous libraries enabling to graph data in a growing variety of forms. One example here would be RGraph页面存档备份,存于互联网档案馆).[15] As of 2011 there is a growing list of JavaScript libraries页面存档备份,存于互联网档案馆) allowing to visualize data.

出版資料故事

There are different options to publish data and visualizations. A basic approach is to attach the data to single stories, similar to embedding web videos. More advanced concepts allow to create single dossiers, e.g. to display a number of visualizations, articles and links to the data on one page. Often such specials have to be coded individually, as many Content Management Systems are designed to display single posts based on the date of publication.

散佈資料

Providing access to existing data is another phase, which is gaining importance. Think of the sites as "marketplaces" (commercial or not), where datasets can be found easily by others. Especially of the insights for an article where gained from Open Data, journalists should provide a link to the data they used for others to investigate (potentially starting another cycle of interogation, leading to new insights).

Providing access to data and enabling groups to discuss what information could be extracted is the main idea behind Buzzdata,[16] a site using the concepts of social media such as sharing and following to create a community for data investigations.

Other platforms (which can be used both to gather or to distribute data):

評量以資料說故事的影響

A final step of the process is to measure how often a dataset or visualization is viewed.

In the context of data-driven journalism, the extent of such tracking, such as collecting user data or any other information that could be used for marketing reasons or other uses beyond the control of the user, should be viewed as problematic.Template:Says who One newer, non-intrusive option to measure usage is a lightweight tracker called PixelPing. The tracker is the result of a project by ProPublica and DocumentCloud.[20] There is a corresponding back-end solution to collect the data. The software is open source and can be downloaded via GitHub.[21]

實例

There is a growing list of examples how data-driven journalism can be applied:

Other prominent uses of data driven journalism is related to the release by whistle-blower organization WikiLeaks of the Afghan War Diary, a compendium of 91,000 secret military reports covering the war in Afghanistan from 2004 to 2010.[24] Three global broadsheets, namely The Guardian, The New York Times and Der Spiegel, dedicated extensive sections[25][26][27] to the documents; The Guardian's reporting included an interactive map pointing out the type, location and casualties caused by 16,000 IED attacks,[28] The New York Times published a selection of reports that permits rolling over underlined text to reveal explanations of military terms,[29] while Der Spiegel provided hybrid visualizations (containing both graphs and maps) on topics like the number deaths related to insurgent bomb attacks.[30]. For the Iraq War logs release, The Guardian used Google Fusion tables to create an interactive map of every incident where someone died[31], a technique it used again in the England riots of 2011.[32]

參見

外部連結

參考文獻

  1. ^ Lorenz, Mirko. Data driven journalism: What is there to learn?. Edited conference documentation, based on presentations of participants. 荷蘭阿姆斯特丹. 2010-08-24 [2012-11-18]. (原始内容存档于2019-06-09). 
  2. ^ Lorenz, Mirko. (2010). Data driven journalism: What is there to learn?页面存档备份,存于互联网档案馆) Presented at IJ-7 Innovation Journalism Conference, 7–9 June 2010, Stanford, CA
  3. ^ Bradshaw, Paul (1 October 2010). How to be a data journalist页面存档备份,存于互联网档案馆). The Guardian
  4. ^ van Ess, Henk. (2012). Gory details of data driven journalism页面存档备份,存于互联网档案馆
  5. ^ 存档副本. [2011-08-17]. (原始内容存档于2011-08-24). 
  6. ^ 存档副本. [2012-11-17]. (原始内容存档于2020-09-19). 
  7. ^ World Bank Data API. [2012-11-17]. (原始内容存档于2016-06-23). 
  8. ^ http://needlebase.com/页面存档备份,存于互联网档案馆) (accessed February 10, 2012)
  9. ^ 存档副本. [2012-11-17]. (原始内容存档于2021-02-25). 
  10. ^ 存档副本. [2012-11-17]. (原始内容存档于2010-04-21). 
  11. ^ 存档副本. [2012-11-17]. (原始内容存档于2012-11-23). 
  12. ^ 存档副本. [2012-11-17]. (原始内容存档于2019-01-31). 
  13. ^ 存档副本. [2012-11-17]. (原始内容存档于2011-08-22). 
  14. ^ 存档副本. [2012-11-17]. (原始内容存档于2014-09-20). 
  15. ^ 存档副本. [2012-11-17]. (原始内容存档于2021-04-22). 
  16. ^ 存档副本. [2011-08-17]. (原始内容存档于2011-08-12). 
  17. ^ 存档副本. [2012-11-17]. (原始内容存档于2021-04-13). 
  18. ^ 存档副本. [2012-11-17]. (原始内容存档于2019-12-22). 
  19. ^ 存档副本. [2021-05-18]. (原始内容存档于2019-01-31). 
  20. ^ 存档副本. [2012-11-17]. (原始内容存档于2016-12-21). 
  21. ^ 存档副本. [2012-11-17]. (原始内容存档于2020-11-22). 
  22. ^ Rogers, Simon (2011) http://www.guardian.co.uk/news/datablog/2011/jul/28/data-journalism页面存档备份,存于互联网档案馆
  23. ^ Evans, Lisa (2011) http://www.guardian.co.uk/news/datablog/2011/jan/27/data-store-office-for-national-statistics页面存档备份,存于互联网档案馆
  24. ^ Kabul War Diary页面存档备份,存于互联网档案馆), 26 July 2010, WikiLeaks
  25. ^ Afghanistan The War Logs页面存档备份,存于互联网档案馆), 26 July 2010, The Guardian
  26. ^ The War Logs页面存档备份,存于互联网档案馆), 26 July 2010 The New York Times
  27. ^ The Afghanistan Protocol: Explosive Leaks Provide Image of War from Those Fighting It页面存档备份,存于互联网档案馆), 26 July 2010, Der Spiegel
  28. ^ Afghanistan war logs: IED attacks on civilians, coalition and Afghan troops页面存档备份,存于互联网档案馆), 26 July 2010, The Guardian
  29. ^ Text From a Selection of the Secret Dispatches页面存档备份,存于互联网档案馆), 26 July 2010, The New York Times
  30. ^ Deathly Toll: Death as a result of insurgent bomb attacks页面存档备份,存于互联网档案馆), 26 July 2010, Der Spiegel
  31. ^ Wikileaks Iraq war logs: every death mapped页面存档备份,存于互联网档案馆), 22 October 2010, Guardian Datablog
  32. ^ UK riots: every verified incident - interactive map页面存档备份,存于互联网档案馆), 11 August 2011, Guardian Datablog

Read other articles:

Ini adalah nama Korea; marganya adalah Yang. Yang Se-chanLahir8 Desember 1986 (umur 37)Dongducheon, Korea SelatanMediaStand-up, televisiKebangsaanKorea SelatanTahun aktif2005–sekarangGenreObservasional, Sketsa, Wit, Parodi, Slapstick, Dramatik, SitkomNama KoreaHangul양세찬 Hanja梁世燦[1] Alih AksaraYang Se-chanMcCune–ReischauerYang Sech'an Yang Se-chan (lahir 8 Desember 1986), adalah seorang komedian Korea Selatan.[2][3] Saat ini Se-chan menjadi salah sa...

 

Keuskupan Valle de la PascuaDioecesis VallispaschalensisLokasiNegaraVenezuelaMetropolitCalabozoStatistikLuas37.900 km2 (14.600 sq mi)Populasi- Total- Katolik(per 2004)360.000352,000 (97.8%)InformasiRitusRitus LatinPendirian25 Juli 1992 (31 tahun lalu)Kepemimpinan kiniPausFransiskusUskupRamón José Aponte FernándezPeta Keuskupan Valle de la Pascua (Latin: Dioecesis Vallispaschalensiscode: la is deprecated ) adalah sebuah keuskupan yang terletak di kota Vall...

 

Untuk kegunaan lain, lihat Karangasem. Koordinat: 8°27′02″S 115°36′22″E / 8.450653°S 115.605973°E / -8.450653; 115.605973 KarangasemKecamatanPeta lokasi Kecamatan KarangasemNegara IndonesiaProvinsiBaliKabupatenKarangasemPemerintahan • CamatCokorda Alit Surya Prabawa, S.STP.[1]Populasi • Total97,584 jiwa (2.016)[2] 82,606 jiwa (2.010)[3] jiwaKode pos80811Kode Kemendagri51.07.04 Kode BPS5107040 Luas94,23 km...

Disambiguazione – Nisida rimanda qui. Se stai cercando altri significati, vedi Nisida (disambigua). NisidaGeografia fisicaLocalizzazioneGolfo di Napoli, Mar Tirreno Coordinate40°47′43″N 14°09′48″E / 40.795278°N 14.163333°E40.795278; 14.163333Coordinate: 40°47′43″N 14°09′48″E / 40.795278°N 14.163333°E40.795278; 14.163333 ArcipelagoIsole Flegree Superficie0,7 km² Altitudine massima109 m s.l.m. Geografia politicaStato&...

 

Kota pasar (Inggris: Market town atau market right) adalah istilah hukum, digunakan sejak abad pertengahan, untuk wilayah di Eropa yang memiliki hak untuk menggelar sebuah pasar, sebuah hak yang tidak dimiliki oleh desa dan kota. Sebuah kota kelurahan (town) dapat saja menggambarkan sebuah kota pasar atau setidaknya memiliki hak untuk menyelenggarakan pasar meskipun sudah tidak lagi memiliki pasar, yang menggambarkan istilah ini masih digunakan hingga sekarang. Padanan kata untuk kota pas...

 

U.S. House district for Ohio OH-16 redirects here. The term may also refer to Ohio State Route 16. Ohio's 16th congressional districtObsolete districtCreated1830Eliminated2020Years active1833–2023 The district from 2013 to 2023 The 16th congressional district of Ohio is an obsolete United States congressional district last represented by Representative Anthony Gonzalez (R). It was last located in the northeast of the state, covering Wayne County and with arms extending north into the suburb...

The Military ranks of Barbados are the military insignia used by the Barbados Defence Force. Commissioned officer ranks The rank insignia of commissioned officers. Rank group General / flag officers Senior officers Junior officers Officer cadet Barbados Regiment[1]vte Major general Brigadier general Colonel Lieutenant colonel Major Captain First lieutenant Second lieutenant Officer cadet  Barbados Coast Guard[1]vte Commodore Commander Lieutenant Commander Lieutenant Sub ...

 

Commune in Bourgogne-Franche-Comté, France Commune in Bourgogne-Franche-Comté, FranceLoulleCommuneThe church in LoulleLocation of Loulle LoulleShow map of FranceLoulleShow map of Bourgogne-Franche-ComtéCoordinates: 46°42′32″N 5°52′56″E / 46.7089°N 5.8822°E / 46.7089; 5.8822CountryFranceRegionBourgogne-Franche-ComtéDepartmentJuraArrondissementLons-le-SaunierCantonChampagnoleGovernment • Mayor (2020–2026) Xavier Racle[1]Area110....

 

2018 Indian drama film BhonsleFilm posterDirected byDevashish MakhijaWritten byDevashish MakhijaMirat TrivediSharanya RajgopalProduced byShabana Raza Bajpayee[1]Piiyush SinghAbhayanand SinghSaurabh GuptaSandiip KapoorStarringManoj BajpayeeSantosh JuvekarCinematographyJigmet WangchukEdited byShweta Venkat MathewMusic byMangesh DhakdeProductioncompaniesMuvizzManoj Bajpayee ProductionsDistributed bySonyLIVRelease dates 5 October 2018 (2018-10-05) (Busan) 26 June&#...

Artikel ini bukan mengenai Abdoel Moeis. Gusti Abdul Muis BiografiKelahiran12 April 1919 Kematian27 September 1992 (73 tahun)Tempat pemakamanKuburan Muslimin Banjarmasin Galat: Kedua parameter tahun harus terisi! Data pribadiAgamaIslam KegiatanPekerjaanulama Haji Gusti Abdul Muis (12 April 1919 – 27 September 1992) adalah seorang kiai dan politikus Indonesia. Dia dikenal sebagai tokoh Muhammadiyah di Kalimantan Selatan.[1] Kehidupan awal Dia lahir di Samarinda pada tah...

 

South Korean actress This biography of a living person needs additional citations for verification. Please help by adding reliable sources. Contentious material about living persons that is unsourced or poorly sourced must be removed immediately from the article and its talk page, especially if potentially libelous.Find sources: Jin Kyung – news · newspapers · books · scholar · JSTOR (September 2018) (Learn how and when to remove this message) In this ...

 

Questa voce sull'argomento calciatori italiani è solo un abbozzo. Contribuisci a migliorarla secondo le convenzioni di Wikipedia. Segui i suggerimenti del progetto di riferimento. Attilio ValobraNazionalità Italia Calcio RuoloCentrocampista Termine carriera1924 CarrieraSquadre di club1 1909-1910 Juventus II3 (1)1910-1911 Juventus9 (0)1911-1914 Piemonte? (?)1914-1924 Torino46 (0) Nazionale 1913 Italia1 (0) 1 I due numeri indicano le presenze e le reti segnate, ...

1961 studio album by Jackie McLeanJackie's BagStudio album by Jackie McLeanReleasedJune 1961RecordedJanuary 18, 1959–September 1, 1960StudioVan Gelder Studio, Hackensack, New Jersey; Van Gelder Studio, Englewood Cliffs, NJGenreJazzLength38:50 LP 62:47CDLabelBlue NoteProducerAlfred LionJackie McLean chronology Capuchin Swing(1960) Jackie's Bag(1961) Bluesnik(1962) Jackie's Bag is an album by American saxophonist Jackie McLean recorded in 1959 and 1960 and released by Blue Note.[1...

 

This X-ray film reveals a poor crown-to-root ratio for tooth #21 (right), the lower left first premolar. The tooth exhibits 50% bone loss, adding roughly 5-7 mm to the clinical crown of what is actually anatomical root. The fulcrum, existing somewhere immediately apical to the height of the bone, does not allow for any adjacent bone to avoid compression or tension, resulting in virtually complete widening of the PDL and a grim prognosis, due to secondary occlusal trauma. Crown-to-root-ratio i...

 

يفتقر محتوى هذه المقالة إلى الاستشهاد بمصادر. فضلاً، ساهم في تطوير هذه المقالة من خلال إضافة مصادر موثوق بها. أي معلومات غير موثقة يمكن التشكيك بها وإزالتها. (نوفمبر 2019) الدوري اليوغوسلافي الأول 1976–77 تفاصيل الموسم الدوري اليوغوسلافي الأول  النسخة 48  البلد يوغوسلافيا&#...

 Gran Premio d'Italia 1973 233º GP del Mondiale di Formula 1Gara 13 di 15 del Campionato 1973 Data 9 settembre 1973 Nome ufficiale XLIV Gran Premio d'Italia Luogo Monza Percorso 5.775 km Distanza 55 giri, 317.625 km Clima Soleggiato Risultati Pole position Giro più veloce Ronnie Peterson Jackie Stewart Lotus-Ford in 1:34.80 Tyrrell-Ford in 1:35.30 (nel giro 51) Podio 1. Ronnie PetersonLotus-Ford 2. Emerson FittipaldiLotus-Ford 3. Peter RevsonMcLaren-Ford Il Gran Premio d'Italia 1973, ...

 

English Bible translator (1342–1402) John TrevisaBornJohn Trevisa1342Trevessa, St. Enoder parish, EnglandDied1402Occupation(s)Theologian, writer, translator, vicar and canonEmployerQueen's College, Oxford Polychronicon Ranulphi Higdin, Monachi Cestrensis, 1865 John Trevisa (or John of Trevisa; Latin: Ioannes Trevisa; fl. 1342–1402 AD) was a Cornish writer and professional translator. Trevisa was born at Trevessa in the parish of St Enoder in mid-Cornwall, in Britain and was a native Corni...

 

Chỉ Nam xa tại Bảo tàng Khoa học Luân Đôn. Chỉ Nam xa (指南車) hay Tư nam xa (司南車) là một phát minh của người Trung Quốc cổ, đây là một cơ cấu truyền động bánh răng có dạng một chiếc xe hai bánh trên đó có một hình nhân luôn chỉ về hướng Nam bất kể hướng chuyển động của chiếc xe, nói cách khác đây là một hệ thống la bàn phi từ tính. Tương truyền Chỉ Nam xa được Hoàng Đế ho...

Catholic archdiocese in France Archdiocese of PoitiersArchidioecesis PictaviensisArchidiocèse de PoitiersPoitiers CathedralLocationCountryFranceEcclesiastical provincePoitiersStatisticsArea13,098 km2 (5,057 sq mi)Population- Total- Catholics(as of 2021)812,900 (est.)661,000 (est.) (81.3%)Parishes28InformationDenominationRoman CatholicSui iuris churchLatin ChurchRiteRoman RiteEstablished3rd Century(as Diocese)8 December 2002(as Archdiocese)CathedralCathedral Basi...

 

1923 1933 Élections générales espagnoles de 1931 28 juin 1931 Composition du Congrès des députés(par blocs) Gauche Républicains de gauche Gauche nationaliste catalane Fédération républicaine galicienne Républicains du centre et de droite Droite et centre nationalistes Droite Droite monarchiste Président du Conseil des ministres Sortant Élu Niceto Alcalá-Zamora DLR Manuel Azaña AR Législature élue Ire(14 juillet 1931 - 9 octobre 1933) modifier - modifier le code - voir Wikida...