🚨 Draft 0.0.2 🚨
Windfall is a planned software project.
Windfall scrapes/fetches data from news and social media sources and applies machine learning principles to predict stock market prices in real-time. It also seeks to provide users with a panel through which they can navigate this nuanced data and strategize to maximize their returns.
Windfall seeks to algorithmically examine the relationship - if any - between the news we consume and the way our financial markets respond in kind. Windfall doesn’t seek to “beat” similar software - and trust us, we know we’re not the first - but rather to approach the situation in its own unique way.
Continued research into the many relevant fields is being done before any official development takes place, both for literacy in the rigorous subject matter, and to help ensure that:
Windfall will be composed of a series of modules. The sections below lightly detail the modules that are planned so far, and provide an overview on how they will work together.
Cheers, here’s to making this pipe dream of an application into reality. 🍺
Below is a list of the 5 primary modules that will comprise Windfall’s back-end. In addition to these modules, there will be a dedicated front-end and a “site” front-end that will provide a simple user CRUD system.
This module is responsible for:
analyzer. Other data spaces are being explored.
Support for GraphQL is being considered as a way to retrieve data from the API, although it is uncertain whether this application will have a public API or be offered “as-is.”
This module is responsible for:
This mediator module is responsible for:
analyzersubmodules, which will each yield insight in different ways. In other words, it is the synchronous endpoint to which the following
analysis-*submodules will resolve.
The goal of this submodule is to produce a “score” for each individual article/post. The hypothesis is that this score, when multiplied by the weight associated with the article’s origin (as produced by the next module below), will allow for a more accurate score overall.
This submodule is responsible for:
The goal of this submodule is to produce coefficients (or “weights”) that describe the reliability of various news sources as predictors of stock performance.
This submodule is responsible for:
Finally, the goal of this module is to “stitch” all the information so far into a bundle of information that is easy to understand.
Below are some “User Stories” of how this module’s data might be used.
The above section detailed the modules involved. This section discusses some implementation-specific information, as distinct from the core Windfall Engine.
Sentiment Analysis and other NLP analytical processes will be applied to the text, ultimately outputting a “score action”. The term
action here is used in the context of Redux actions.
Inasmuch as possible, this operation will be done asynchronously, as per this documentation, to prepare Windfall as an “always-on, always-analyzing” engine that is minimally capped by low-spec hardware.
With an asynchronous, redux-based implementation, we should be able to amass an extremely navigable record of changes. By observing the patterns in these changes, we can prescribe a course of action in that taunting binary choice that commands the stock market: invest, or don’t?
Financial records are immaculately preserved, and serve as a potent resource for machine learning training. It is possible that by retroactively parsing news resources and pairing them semantically with news coverage patterns within the same units of time, we can synthesize an understanding of the following:
These two data points will allow us to hone our news parsing algorithm by assigning weights to data gleaned from Source A vs Source B, ultimately resulting in a far more accurate model over all.
ALTERNATIVELY, as a practical consideration and following the software principle that software should complement, rather than completely replace the element of human decision, we may simply opt to provide a GUI-friendly portal to this historical stock data instead of using it as a factor relevant to the scoring.
Details regarding the architecture for the site.
There will be several modules. A dedicated parser/scraper API, a server for users and back-end data, and a ReactJS front-end. The software is, ultimately, intended for use by people and not developers. Thus, the API will be internal use only.
A server running Flask will contain processes that are polling all data sources at some yet-undetermined interval.
cron or some daemon will be deployed to listen for new news stories and parse them upon release. This server will contain the code for the primary modules mentioned above.
A second server running Laravel or Echo with the GORM ORM will be deployed. This server will contain the code controlling user sign up and user-specific data, such as stocks to track, as well as interactively generate personalized financial projections.
The Laravel server will also “dumbly” serve static past stock historical data, using AJAX and a search engine like Apache Solr and D3 to allow for swift browsing of stock histories. These can just use Blade templates.
A third server will serve a React-powered front-end. This “admin panel” of sorts will allow you to view your timeline of promising stocks, track specific stocks, create manual mappings between stocks and keywords, etc.
There will be staging environments for each component. Jenkins CI will be used to build each application as necessary on EC2 instances.
nginx will be used.
Although the exact libraries used are obviously difficult to predict, some likely libraries will be:
Back-end libraries/frameworks will include Flask for the scraping API, either Laravel or Echo for the main web application back-end.
Front-end libraries will include Bulma, along with Google WebFonts and TypeKit. Webpack will be used to build, bundle, and optimize assets.
ReactJS will be used to power the “dashboard” or “admin panel” aspect. Redux will be used as the state container. ES6 may be used, although the Python versions of ES6-esque features (such as “promises” ) is being researched.
No idea. I’ll probably never finish.
Yes, I’m aware. It’s essentially a pipe dream.
A sacrosanct desire to leverage the immense computational power available to us to empower and equip the lower/middle class so that they might auspiciously compete in that global financial arena which has, for too long, been bloodied by the-
Sorry bro I just get so heated u know
I’ve been better. I’ve been getting caffeine headaches.
Oh, okay. Thank you.
No. The project doesn’t even exist in a tangible state yet. I’d like to have more than just the first draft of a theoretical overview before I ask people to join in.
No, but I know more than I did last year. And I’ll know more next year than I do now. And so it goes.
For now, just me. Later, anyone who wants to use it.
I’ve always wanted to be a baker.View All Posts