We are young, inexperienced, prone to fail: we’re 2 first time founders. I always prefer to learn from people who are just a few steps ahead of me.
If you’re a multi-time entrepreneur then you might not find this article interesting. If you’ve been thinking about starting your own startup then I’d recommend reading as many similar articles as possible.
Artem (author of this article):
When I was designing Elasticsearch index for NewsCatcherAPI, one of the biggest problems I had was handling multi-language news articles.
I knew that Elasticsearch has pre-build analyzers for the most popular languages. The question was “How do I manage to have documents with different languages that I can search all together (if needed)?”
Important: in our case, we had each document already labeled with the correct language. Still, it is not necessary for all approaches described in this post.
Also, for this post set up, let’s assume that each document (news article) has only 2 fields:
TL;DR README of your repository should show/present your code as if it was a product that you sell.
Disclaimer: it is my personal blog. What I describe here works(ed) for me. There is no guarantee that my approach is an ultimate guide that guarantees a 100% result. Also, trying to get N GitHub stars itself is a bad approach — I just want to help you understand how to make other people notice the work you have done already.
6 months ago when I and my friend were preparing to launch a closed beta of NewsCatcherAPI I had a simple…
In this article, I would like to share with you my little observations on how many junior developers failed to convince me to hire them by not putting a README to their repositories.
Useful links on how to craft a stunning README at the end of the article.
Usually, README is a file in the repository of software/project that briefly explains it.
README file is there to pitch your work.
README is not your documentation (unless it can be fitted into one page).
If you make your repository public, you most likely will be judged by it. …
UPD: The Python package I talk about in this article does not depend on any external API. You do not need to be our client to use it!
About two months ago, I saw a problem coming. My side-project newscatcherapi.com was soon to be ready to launch for a closed beta. But, we had 0 sign-ups in our email list.
For a little back story, Newscatcher is a Data-as-a-Service company that builds an API to search through online news articles. Just like Google searches the most relevant web pages, we return you the data on the most relevant news articles. …
In this post, I will go through my experience of developing, deploying and selling my API via an API marketplace. I did not have to set up a website or think about how to integrate payment processing solutions. I just wrote my code and deployed it.
Building a startup requires a team. A team of a few jacks of all trades: coders, marketing, sales. And, it is a long and exhausting path, therefore, low chances to succeed.
You do not have to launch a startup to begin your own thing. …
Within 60 days we:
Official website of the product that I will talk about in this article.
In this article:
We are a team of 2 data engineers. Within February and March 2020 we dedicated most of our spare time building an API that allows you to search for the news articles’ data.
It is like querying…
As I am writing this article, many people have to work from home, some have a lot of free time during this period. You can use this time to build your portfolio, enhance your skills or begin a side-project.
Newscatcher package makes it easy to collect and normalize news articles data without any external dependencies. It was built while we were working on our main Data-as-a-Service product called Newscatcher API. We are the developers-first team, therefore, we open-source as much as possible so that coders can partially replicate the job we have done for free.
The way to use our…
At Politwire, we monitor and analyze the media coverage of politicians. We help political campaigns, businesses and media houses to understand politicians’ media paths.
On March 3, 2020, fourteen US states held primaries. About one-third of all delegates were involved. As you might already know, Joe Biden took back the lead by taking around 100 delegates more than Bernie Sanders.
Even though Joe Biden ended up as a winner, did it give him a significant lead of media coverage?
In the graph above, we can see the number of articles for each candidate per each day of the Super…
If I had to describe Elasticsearch in one phrase I would say something like:
When search meets analytics at scale (in near real time)
Elasticsearch is in the top 10 most popular open-source technologies at the moment. Fair enough, it unites many crucial features that are not unique itself, however, it can make the best search engine/analytics platform when combined.
More precisely, Elasticsearch has become so popular due to a combination of the following features: