All Posts

Data mining PII via optical character recognition on publicly hosted image sites pt. 1

Cybercrime is expected to reach $6 trillion in damages in 2021 (Herjavec Group, 2019), and has an average yearly impact to the global economy of $450 billion (Harper et al., 2018). Many instances of identity theft are the result of simple phishing campaigns, where a malicious actor tricks a target into entering their username and password into a fake version of a trusted website or downloading an attachment containing malware.

Text Messages Sentiment Analysis

Introduction Personal communications have always been a research-rich resource. Theodore Roosevelt’s journal and correspondence have been studied by historians, the FBI wants a backdoor into iPhones, even metadata about our phone calls are extremely valuable to the intelligence community. Sentiment analysis is a method used to study the subjectivity of collections of text. It can be used in a variety of applications from analyzing song lyrics, to product reviews, to assessing public opinion through trends on Twitter.

OCR and Text Mining pt. 1

The conversation started out simply enough at lunch one day: “Dude have you ever been to prnt.sc? There’s this tool called Lightshot that uploads screenshots and just indexes the URLs. You could easily write a script to loop through them.” I had done something similar several years ago in college when I wrote a script to randomly generate Imgur URLs for a drinking game. Cat was one drink, celebrities were three or something like that.

Set Up Raspberry Pi via SSH

Setting up a Raspberry Pi without a monitor, keyboard, or mouse can be intimidating until you know how. The following steps will get you up and running in no time using a Mac. What you need: Raspberry Pi Micro SD card for the Pi to boot from Ethernet cord Now that you have all the tools, let’s get the Pi running! Download Raspbian Download the latest version of Raspbian Buster Lite from the official site.

Simplify Test Cases

Writing and maintaining test cases per current industry standard is detrimental to the engineering effort of testing. A major component of this lack of realism stems from the belief that test cases must be written in such a way that anyone can execute them. The dream makes sense: in a crunch it would be great if the entire company could pitch in and execute tests. The beauty of this idea is skin deep at best.

API Testing with Node.js and Gitlab

TL;DR - clone this Having automated tests is a good thing, but having automated tests that run as part of your build process is even better. The following few paragraphs are going to illustrate how to get started down that path. First, a few essentials and a little about the tools that I’m using here. There are quite a few continuous integration tools out there, but for this example we are going with Gitlab.