A presentation at AWS Community Day Germany 2019 in in Hamburg, Germany by Bruno Amaro Almeida
What can AWS tell us about fake and credible news media websites? Bruno Amaro Almeida | 9 Sept 2019 @bruno_amaro Community Day 2019 Sponsors
FUTURE. CO-CREATED. Nordic Roots, Global Mindset PEOPLE NATIONALITIES 550+ 38 8 30% OFFICES Tampere Helsinki Oslo Stockholm YoY GROWTH Family of Companies eCommerce & Growth Hacking Berlin London Artificial Intelligence & Machine Learning Stuttgart Munich
Who is this guy? Principal Architect & Technology Advisor @ Futurice ! native, based in ” Cloud, DevOps, Security, Data Engineering & AI Reach out on: @bruno_amaro BERLIN · HELSIN K I · LON DON @brunoamaroalmeida · MUN ICH · OSLO · STOCK HOLM · TAMPERE
AI & Analytics Capabilities Data Engineering (ingest, prepare, transform, analyze) AI/ML Platform (build, train, deploy) AI/ML API’s (pre-trained models, serverless, out of the box) @bruno_amaro
AWS vs GCP vs Azure: Data Engineering / AI Ingest ETL • AWS Kinesis • AWS Glue / EMR • Google Pub/Sub • Google Dataflow / DataProc • Azure Event Hubs • Azure DataFactory / DataBricks Raw Storage • AWS S3 • Google Cloud Storage • Azure Data Lake Storage Data Warehouse • AWS Redshift • Google Cloud BigQuery • Azure SQL Data Warehouse Machine Learning • AWS SageMarker • Google Cloud Datalab Analytics / BI • AWS QuickSight • Google Cloud Data Studio • Azure ML Studio / Workbench • Power BI @bruno_amaro
AWS vs GCP vs Azure: AI/ML API’s AI/ML Service APIs AI/ML Service APIs AI/ML Service APIs • AWS Lex • Google Dialogflow • Azure Bot Service • AWS Rekognition • Google Vision API • Azure Vision • AWS Translate • Google Text-to-Speech API (ASR) • AWS Polly (TTS) • Google Speech-to-Text API • Azure Speech • Translator Speech API, Bing Speech API • AWS Transcribe (ASR) • Google Natural Language API (NPL) • AWS Textract (OCR) • Google Translation API • Azure Knowledge • AWS Comprehend (NPL) • Google Video Intelligence API • Azure Search • AWS Forecast (Time-series forecast) • Google Inference API (Time-series forecast) • Google Job Discovery • Bing News/Web/Image/Video/Custom Search • Azure Language • Google Cloud Genomics (Store and process genomes and related experiments ) Source: AWS • In preview: Speaker Recognition API, Custom Speech Service Source: Google Cloud • Language Understanding (LUIS), Bing Spell Check, Text Analytics, Translator Text API Source: Azure @bruno_amaro
News Media Websites (Fake vs Credible)
Website Metadata Extraction Methods xvfb-run (…) wkhtmltoimage (…) lynx —dump image-scraper Pressure on Theresa May to resign will ‘increase dramatically’ following extension, warns David Davis + Review set for June 21 after Macron opposes long delay + EU already talking about possibility of further extension + Britain’s EU ambassador formally accepts extension [51]ReconstructionHow Emmanuel Macron raged against Britain’s chaotic [52]Janet DaleyAny Brexit solution with Theresa May in post is impossible [53]How long does the PM have left after being forced to accept a six month (…) @bruno_amaro
AWS Services • Rekognition Enrich website metadata with AI/ML API • Comprehend @bruno_amaro
Enrich website metadata with AI/ML API @bruno_amaro
AWS Comprehend for Sentiment Detection
AWS Comprehend for Sentiment Detection
AWS Comprehend for Sentiment Detection
AWS Comprehend for Sentiment Detection
Scale, Aggregate, Profit vs BERLIN · HELSIN K I · LON DON · MUN ICH · OSLO · STOCK HOLM · TAMPERE Photo by Elijah O’Donnell on Unsplash
Interesting findings Text Categories / Entities (AWS) @bruno_amaro
Interesting findings Text Categories / Entities (Google Cloud) @bruno_amaro
Interesting findings Sentiment Analysis (AWS) @bruno_amaro
Interesting findings Sentiment Analysis (Google Cloud) @bruno_amaro
Interesting findings Image Recognition & Moderation Labels (AWS) @bruno_amaro
Interesting findings Image Recognition & Safe Search Annotation (Google Cloud) @bruno_amaro
Thank you! Kiitos! Danke! Tack! Bruno Almeida P RINC IP AL ARC HITE C T & TE C HNOL OGY ADV ISOR Cloud, Security, DevOps, Data Engineering & AI Reach out on: @bruno_amaro @brunoamaroalmeida BERLIN · HELSIN K I · LON DON · MUN ICH · OSLO · STOCK HOLM · TAMPERE
What happens when you enrich metadata collected from news media with the AI/ML API’s (e.g. Image Classification, Translation, Sentiment Analysis) that AWS provides?
In this talk, we will see what kind of insights we gain when leveraging these ready-made Serverless AI/ML capabilities.
Here’s what was said about this presentation on social media.
What can AWS tell us about fake and credible media websites? @bruno_amaro on #awscommunityday - Thanks for the insights pic.twitter.com/rC00vTDVtF
— Tobias (@tweini) September 9, 2019
Next lightning talk at #AWSCommunity day, this time by @bruno_amaro on analysis for fake and credible news media websites using #aws pic.twitter.com/mDU9X84aqX
— Steffen Mazanek (@smazanek) September 9, 2019