Investigative journalism tools
Free software and open-source tools for journalists, journalistic research, discovery, investigative reporting, privacy, data visualization, data driven journalism and datajournalism
Investigative journalism and journalistic research
Sie sind hier
Startseite
Free software and open source tools for investigative journalism and journalistic research
Free software and open source tools for investigative journalism and journalistic research
Free software for journalists: Tutorials, bookmarks and open source tools for journalistic research, investigations and privacy and other digital tools for investigative journalism and data driven journalism or datajournalism:
Independent media tools for journalists and investigative reporting
With free open source software it is possible to run research tools for sensitive documents or data on your own computer or server instead of spying cloud services.
Tutorials and tips: How to use open source research tools for investigative journalism
Toolbox: Free software, open source tools and resources
Free software and open source discovery and research tools for journalists:
Search engines for fulltext search and discovery
Research methods, techniques and technology: Fulltext search, Information retrieval, Desktop Search, Enterprise Search and faceted search
Tutorials:
How to search, sort, explore and filter large document collections or many search results
How to use boolean search operators
Open source search tools:
Search libraries and APIs
If you want code yourself, you can use this powerful engines as base:
Solr: Index and search API
Elastic Search: Index and search API
Databases, digital archives, data management systems, document management systems and content management systems
Methods: Archive, database, forms, categories (tagging), classification, meta data, repository, document management (DMS), content management (CMS) or enterprise content management (ECM), knowledge management, knowledge base, bookmarks
Tagging and annotation
Methods: Annotation, Tagging, Social Tagging, Folxonomies
Tutorial: Tagging and annotation for collaborative investigative journalism
Zotero: Bookmark database and citations manager with tagging and annotation features
Docear: Bookmark database and citations manager with mindmap, tagging and annotation features
Document Cloud: Tagging and annotation for paper based documents like scans or PDF documents
Neonion: Collaborative annotations within text
Pundit: Annotations within text and within images
Hypothesis
Annotator.js
Text mining, text analysis and document mining
Method: Text mining, Natural Language Processing (NLP), Named entities extraction
Text mining tutorial: How to analyze large document collections: Text mining with the search engine Open Semantic Search
Understanding language data: Open-source NLP software can help
Overview project: Showing most used words and trees of most used words
Jigsaw: Text mining tool (not open source, but free download)
More:
Wikipedia list of open source text mining software
Tapor: Text Analysis Portal for Research
Reconcilation and merging
Methods: Compare, merge, reconcile, link, clustering
Fuzzy search with lists: Checks, if there are search result for each list entry
OpenRefine
DocDiff: Shows and visualize the differences between two versions of a text
Fslint: Compares two directories and searches for same files which are in both directories
Graphs and social network analysis (SNA)
Tools to analyze and visualize connections and relations:
Network analysis tutorial: How to visualize connections & relations in documents with Open Semantic Search
Gephi: Desktop tool for analysis and data visualization of networks, connections and graphs
Cytoscape.js: Javascript library for data visualization of networks, connections and graphs
Semantic Mediawiki: Very flexible CMS for linked data
Detective: Python/Django and neo4j graph database based CMS for connections
Privacy, security, safety and encryption
Digital security: Protect your research, sources and whistleblowers with privacy tools and encryption tools:
Methods: Encryption (PGP, OTR) and anonymization
Tutorials:
Surveillance self-defense: Tips, Tools and How-tos for Safer Online Communications
Security in a box
Encryption works
How to setup an search engine on an encrypted usb key or external harddrive
Information Security for Journalists
Open source tools:
Media monitoring, news filtering, news pipes and alerts
Open source software for media monitoring, news processing, news filtering and alerting:
Extract data or convert data
Methods: Data integration, extraction, data converter, data migration, ETL (Extract Transfer Load), Scraping
Extract text or structured data from documents
Documents: Tika content analysis toolkit: Extract text and meta data from documents of many different file formats
CSV tables: CSV Manager: Import big csv spreadsheets to Solr based search engines
PDF tables: Tabula: Extracts spreadsheets from PDF documents
Scans and images: Optical character regognition (OCR)
Extract text from images (OCR)
Tesseract: OCR Software to recognize text from images
Scantailor: Deskewing low quality scans
Extract text from sound files (speech recognition)
CMU Sphinx: Open source speech recognition toolkit
Extract structured data from websites (Scraping)
Portia: Extract structured data from websites by a visual user interface
Scrapy: Extract structured data from websites by Python scrapers
Extract transform load (ETL) Frameworks for import and transform or convert data
Transform to plain text: Tika content analysis toolkit
Apache NiFi: Extract, transform, load and distribute data
Talend Open Studio: Import and transform data to other formats
Kettle: Import and transform data to other formats
LogStash: Import and transform data from datasources like logfiles to an structured search index
Data visualization
Method: data visualization
Tools for data visualization or data visualisation:
Charts and diagrams
Datawrapper - Webapp and user interface for easy generating charts
HUE Solr search
Kibana for Elastic Search
Apache Zeppelin
Superset
Banana for Solr
NVD3: Javascript library for easy programming of charts with D3
Maps and mapping (spatial data)
Create interactive maps and visualize spatial data (geodata) with open source software for mapping:
Visualize events on a timeline
Create timelines with open source timeline tools and visualize events on interactive multimedia timelines:
Tutorial on timelines
TimelineJS
Simile Timeline
Odyssey.js: Combines a timeline with a map for timelines for spatial data
Graphs, networks, connections and relations
Network analysis tutorial: How to visualize connections & relations in documents with open semantic search
Gephi: Desktop tool for analysis and data visualization of networks, connections and graphs
Cytoscape.js: Javascript library for data visualization of networks, connections and graphs
Sigma js: Javascript library for data visualization of networks, connections and graphs
Redact documents and delete meta data
Clean sensitive documents and delete meta data stored invisible inside the document files or photos like serial numbers of hardware (i.e. of your photo camera) or software or user names:
PDF Redact Tools: Most secure way to delete meta data from PDFs
MAT: Metadata Anonymisation Toolkit: Userinterface to delete meta data from different document formats and image formats
Statistics and analytics
Method: Data analysis, statistics, chart, diagram, data visualization
Universal open source toolset
The ultimate universal open source toolset is a Linux distribution like Debian GNU/Linux or Ubuntu Linux comming with thousands of packages of free software and open source tools, software libraries and programming languages.
You dont have to remove your existing operating system: With open-source virtualization software like Virtual Box for Windows or Mac you can run a Linux distribution within a window in your existing operating system environment.
Maybe you want to start with Linux on your existing system environment with the preconfigurated Debian based virtual maschine (VM) Open Semantic Desktop Search providing a preselected and preconfigurated collection of tools for investigative journalists.
Subscribe
RSS-NewsfeedFacebookTwitter
Subscribe to our Newsfeed.
Investigative journalism tools
Search tools
Text analysis, text mining and document mining
Annotation
Databases and document management
Graphs and social network analysis
Privacy, security & encryption
Data visualization
News, monitoring and alerts
Datavisualization
Charts
Timelines
Mapping: Interactive maps
Networks, connections and relations
www.mandalka.name
www.mandalka.name