Koii Data Aggregator
Introductionβ
Thanks to the growth and adoption of AI, data on the web is now more precious than ever and community devices are the most efficient way to access it. Koii makes crowd sourcing this data more affordable and accessible to end-users, developers, and companies in need.
Web platforms are incentivized to restrict data, as API costs rise, so does the value of the data. In contrast, a decentralized data market like Koii becomes more affordable with increased competition.
One of our example template tasks is a Twitter Crawler that uses nodes to efficiently crawl Twitter Data. As an example, we have improved the crawler to function as a full-featured Twitter API alternative allowing users to collect data, automate Twitter replies, filter by engagement, and pull data from specific tweet IDs. However, this is just an example of how you can:
- Gather Data from any Website using Koii.
- Put that Information on a Database.
- Create an API that accesses that data.
Featuresβ
Koii Network's Twitter Crawler uses community devices to efficiently crawl tweets with speed and flexibility. While this doesnβt expose private information, it allows crowdsourced data projects to bypass the limitations of traditional Twitter API rate limits. Our solution provides data buyers with more granular control, allowing them to gather large amounts of data specific to their requirements.
Feature | Koii Data Aggregator |
---|---|
Working Twitter Search | βοΈ Yes |
No Proxy Required | βοΈ Yes |
Reply to Tweets | βοΈ Yes |
Follow Accounts | βοΈ Yes |
Tweets Per Hour | 2000 Tweets |
Search Data Range | Unlimited |
Advanced Search | βοΈ Yes |
Engagement Filter | βοΈ Yes |
Date Filter | βοΈ Yes |
Profile Filter | βοΈ Yes |
Link, Video, Image Filters | βοΈ Yes |
Exclude Retweets | βοΈ Yes |
- Cost Savings: Centralized API prices rise with data's value, while our decentralized approach becomes more affordable with more nodes, avoiding large Twitter API costs.
- Large Data: With Koii, you can gather more tweets more often. No rate-limits and ALL the data available NOT just what the API provides.
- Relevant Data: Only gather what you need. By setting your requirements, you ensure only relevant data is fetched, improving the efficiency of your compute tasks. Community nodes only receive incentives if the tweets they submit match your specifications (i.e. tweets since 09/01/2023, containing
"#elon_musk_is_cool"
) - Filter by Engagement: Users can filter tweets based on a threshold of likes, retweets, or replies, which the Twitter API does not support.
Support for Reddit and Facebook.
Data Structureβ
data = {
user_name: "username",
screen_name: "screen_name",
user_url: "user's URL",
user_img: "user's profile image URL",
tweets_id: "tweet ID",
tweets_url: "tweet URL",
tweets_content: "tweet text",
time_post: "time the tweet was posted",
time_read: "time the tweet was read",
comment: "number of comments",
like: "number of likes",
share: "number of shares",
view: "number of views",
outer_media_url: "array of media URLs",
outer_media_short_url: "array of short media URLs"
}