What can AWS tell us about fake and credible news media websites?

A presentation at AWS Community Day Germany 2019 in September 2019 in Hamburg, Germany by Bruno Amaro Almeida

Slide 1

Slide 1

What can AWS tell us about fake and credible news media websites? Bruno Amaro Almeida | 9 Sept 2019 @bruno_amaro Community Day 2019 Sponsors

Slide 2

Slide 2

FUTURE. CO-CREATED. Nordic Roots, Global Mindset PEOPLE NATIONALITIES 550+ 38 8 30% OFFICES Tampere Helsinki Oslo Stockholm YoY GROWTH Family of Companies eCommerce & Growth Hacking Berlin London Artificial Intelligence & Machine Learning Stuttgart Munich

Slide 3

Slide 3

Who is this guy? Principal Architect & Technology Advisor @ Futurice ! native, based in ” Cloud, DevOps, Security, Data Engineering & AI Reach out on: @bruno_amaro BERLIN · HELSIN K I · LON DON @brunoamaroalmeida · MUN ICH · OSLO · STOCK HOLM · TAMPERE

Slide 4

Slide 4

AI & Analytics Capabilities Data Engineering (ingest, prepare, transform, analyze) AI/ML Platform (build, train, deploy) AI/ML API’s (pre-trained models, serverless, out of the box) @bruno_amaro

Slide 5

Slide 5

AWS vs GCP vs Azure: Data Engineering / AI Ingest ETL • AWS Kinesis • AWS Glue / EMR • Google Pub/Sub • Google Dataflow / DataProc • Azure Event Hubs • Azure DataFactory / DataBricks Raw Storage • AWS S3 • Google Cloud Storage • Azure Data Lake Storage Data Warehouse • AWS Redshift • Google Cloud BigQuery • Azure SQL Data Warehouse Machine Learning • AWS SageMarker • Google Cloud Datalab Analytics / BI • AWS QuickSight • Google Cloud Data Studio • Azure ML Studio / Workbench • Power BI @bruno_amaro

Slide 6

Slide 6

AWS vs GCP vs Azure: AI/ML API’s AI/ML Service APIs AI/ML Service APIs AI/ML Service APIs • AWS Lex • Google Dialogflow • Azure Bot Service • AWS Rekognition • Google Vision API • Azure Vision • AWS Translate • Google Text-to-Speech API (ASR) • AWS Polly (TTS) • Google Speech-to-Text API • Azure Speech • Translator Speech API, Bing Speech API • AWS Transcribe (ASR) • Google Natural Language API (NPL) • AWS Textract (OCR) • Google Translation API • Azure Knowledge • AWS Comprehend (NPL) • Google Video Intelligence API • Azure Search • AWS Forecast (Time-series forecast) • Google Inference API (Time-series forecast) • Google Job Discovery • Bing News/Web/Image/Video/Custom Search • Azure Language • Google Cloud Genomics (Store and process genomes and related experiments ) Source: AWS • In preview: Speaker Recognition API, Custom Speech Service Source: Google Cloud • Language Understanding (LUIS), Bing Spell Check, Text Analytics, Translator Text API Source: Azure @bruno_amaro

Slide 7

Slide 7

News Media Websites (Fake vs Credible)

Slide 8

Slide 8

Website Metadata Extraction Methods xvfb-run (…) wkhtmltoimage (…) lynx —dump image-scraper Pressure on Theresa May to resign will ‘increase dramatically’ following extension, warns David Davis + Review set for June 21 after Macron opposes long delay + EU already talking about possibility of further extension + Britain’s EU ambassador formally accepts extension [51]ReconstructionHow Emmanuel Macron raged against Britain’s chaotic [52]Janet DaleyAny Brexit solution with Theresa May in post is impossible [53]How long does the PM have left after being forced to accept a six month (…) @bruno_amaro

Slide 9

Slide 9

AWS Services • Rekognition Enrich website metadata with AI/ML API • Comprehend @bruno_amaro

Slide 10

Slide 10

Enrich website metadata with AI/ML API @bruno_amaro

Slide 11

Slide 11

AWS Comprehend for Sentiment Detection

Slide 12

Slide 12

AWS Comprehend for Sentiment Detection

Slide 13

Slide 13

AWS Comprehend for Sentiment Detection

Slide 14

Slide 14

AWS Comprehend for Sentiment Detection

Slide 15

Slide 15

Scale, Aggregate, Profit vs BERLIN · HELSIN K I · LON DON · MUN ICH · OSLO · STOCK HOLM · TAMPERE Photo by Elijah O’Donnell on Unsplash

Slide 16

Slide 16

Interesting findings Text Categories / Entities (AWS) @bruno_amaro

Slide 17

Slide 17

Interesting findings Text Categories / Entities (Google Cloud) @bruno_amaro

Slide 18

Slide 18

Interesting findings Sentiment Analysis (AWS) @bruno_amaro

Slide 19

Slide 19

Interesting findings Sentiment Analysis (Google Cloud) @bruno_amaro

Slide 20

Slide 20

Interesting findings Image Recognition & Moderation Labels (AWS) @bruno_amaro

Slide 21

Slide 21

Interesting findings Image Recognition & Safe Search Annotation (Google Cloud) @bruno_amaro

Slide 22

Slide 22

Thank you! Kiitos! Danke! Tack! Bruno Almeida P RINC IP AL ARC HITE C T & TE C HNOL OGY ADV ISOR Cloud, Security, DevOps, Data Engineering & AI Reach out on: @bruno_amaro @brunoamaroalmeida BERLIN · HELSIN K I · LON DON · MUN ICH · OSLO · STOCK HOLM · TAMPERE