Question Stopping novel aggregators?

yamibae · Apr 2, 2019

Likely the one and only method is to paywall the chapters because the aggregator owners won't find the cost worth it to get access by paying a recurring fee just to access one novel.

xiazixin · Apr 2, 2019

Isn't it easy? The bots use novel updates home page to function.. Just set the date of your release on novelupdate to 10 days ago
@Tony should make an anti bot to the home page.
Instead of crawling on all the sites it's more effective to craw on novelupdates home page where all the latest updates is. BTW isn't capcha against novelupdates regulation of no password protected content/locked contents? Well novelupdate kinda supports the alligator in my perspectives

lnv · Apr 2, 2019

yamibae said: ↑

Likely the one and only method is to paywall the chapters because the aggregator owners won't find the cost worth it to get access by paying a recurring fee just to access one novel.
Click to expand...

As I said, there is a much better way >.>

Wyrede · Apr 2, 2019

xiazixin said: ↑

Isn't it easy? The bots use novel updates home page to function.. Just set the date of your release on novelupdate to 10 days ago
@Tony should make an anti bot to the home page.
Instead of crawling on all the sites it's more effective to craw on novelupdates home page where all the latest updates is.
Click to expand...

block the NUF homepage to bots and they'll just directly target translators site
this kind of thing is like playing catch, you catch them once, you cannot catch them forever

yamibae · Apr 2, 2019

lnv said: ↑

As I said, there is a much better way >.>
Click to expand...

What way?

lnv · Apr 2, 2019

yamibae said: ↑

What way?
Click to expand...

You have to promise to keep it a secret though.

xiazixin · Apr 2, 2019

Wyrede said: ↑

block the NUF homepage to bots and. . .... ..
Click to expand...

Well with the amount of new sites and the mass numbers of updates the it's more cost effective and realistic. There are constant novels being dropped and pick up by another groups there are also temporarily sites like forums and such.. Well novelupdates kinda supports the alligator with no Google Doc policy and all what not. Just change the upload date to 10 to 20 days ago if you don't want it get stolen when you upload.

xiazixin · Apr 2, 2019

I have been experimenting this quite some time and it worked wonders

yamibae · Apr 2, 2019

lnv said: ↑

You have to promise to keep it a secret though.
Click to expand...

Please DM me if it's secret

Ai chan · Apr 2, 2019

grish99 said: ↑

but that annoying as fork for readers.
the worst is when you have 3 option link that a correct option lead to another 3 option link and every option lead to text that you must read some to understand that its a fake one....
edit: and if you click wrong to many time you get a lead poisoning
Click to expand...

3 option link on every page is not actually necessary. One link is enough, you just need to make it so that it's not consistent. If the readers complain about clicking one or two links, well, that just means it's time to stop translating. Back in my day, readers would consciously click the ads just so that could support the translator. Nowadays everyone just think they're entitled to free, easy stuff.

lnv said: ↑

It's not a problem for scrappers to get around that once they know what is going on unless you are really unpredictable. The reason is simple, if you stick to a pattern, they can create rules for multiple patterns, they can even look for certain words on a page.

There is actually a much better method if you exploit the concept of what a scrapper is. And t works without annoying the users too.
Click to expand...

Ai-chan guess you mean one of the two:
1) norobots
- Which is bad for most translators because it pretty much means your site will never be ranked. Aggregator bots will ignore it anyway.
2) spider trap
- Which is fine if you already have norobots. You waste aggregators' resources so that scraping you gets extremely expensive. However, this means legitimate bots like google will mark your site as malicious if you do not have norobots.

Kainord · Apr 2, 2019

Walter vi Britannia said: ↑

Why!? By who!?
Is it really something to threaten about while there are hundreds of other sites out there actually fucking stealing the chapters!!?
Click to expand...

It was 2-3 years ago. Because they provided direct link to chapters they got a letter to stop or go down. Dumb reasons but worked.

Wujigege · Apr 2, 2019

lnv said: ↑

You have to promise to keep it a secret though.
Click to expand...

Such a tease lol

lnv · Apr 2, 2019

Wujigege said: ↑

Such a tease lol
Click to expand...

It isn't a tease, I PMd them, if the method is made public, then it can be defeated, by keeping it secret and PMing people, it prevents it from being abused.

Ai chan said: ↑

3 option link on every page is not actually necessary. One link is enough, you just need to make it so that it's not consistent. If the readers complain about clicking one or two links, well, that just means it's time to stop translating. Back in my day, readers would consciously click the ads just so that could support the translator. Nowadays everyone just think they're entitled to free, easy stuff.
Click to expand...

If there is only 1 link, a scapper can easily search for links within a certain box and click it. Of course you can have some invisible links to trick the scrapper. But that only makes it partly harder.

Ai-chan guess you mean one of the two:
1) norobots
- Which is bad for most translators because it pretty much means your site will never be ranked. Aggregator bots will ignore it anyway.
2) spider trap
- Which is fine if you already have norobots. You waste aggregators' resources so that scraping you gets extremely expensive. However, this means legitimate bots like google will mark your site as malicious if you do not have norobots.
Click to expand...

Close, think benefit of all without the downsides.

Wujigege · Apr 2, 2019

lnv said: ↑

It isn't a tease, I PMd them, if the method is made public, then it can be defeated, by keeping it secret and PMing people, it prevents it from being abused.

If there is only 1 link, a scapper can easily search for links within a certain box and click it. Of course you can have some invisible links to trick the scrapper. But that only makes it partly harder.

Close, think benefit of all without the downsides.
Click to expand...

PM then. I am curious

xiazixin · Apr 2, 2019

Ai chan said: ↑

1) no robots......
- .
Click to expand...

It have no use, no robots.txt is just some thing for search engines to use. It help reduce the chance appearing on search engines if the search engines respect your robots.txt.
You can remove your site map all you want.
Our site black box internet service translation dose not have a site map and have no indexed on meta of all pages.
The bots just take as usual.
Its easy.
They use the latest chapter release powerby novelupdates dot com. Go to the link. If characters of body in html more than 600, stole this page. If not check first links from this page whether if have a body with more than 600 text.
I aways puts unedited links before my edited links. And the bots fall for it.

Tony · Apr 2, 2019

xiazixin said: ↑

Isn't it easy? The bots use novel updates home page to function.. Just set the date of your release on novelupdate to 10 days ago.
Click to expand...

Yes, that would probably work but you'll be losing exposure from NU's main page. If you don't care about losing some traffic, this would most likely work.

xiazixin said: ↑

@Tony should make an anti bot to the home page.
Instead of crawling on all the sites it's more effective to craw on novelupdates home page where all the latest updates is.
Click to expand...

I've tried blocking the bots multiple times but after blocking them, they find a way back. It's an endless loop and at the end of the day, I felt like it was better to use my energy improving the site rather than fight a losing battle.

xiazixin said: ↑

BTW isn't capcha against novelupdates regulation of no password protected content/locked contents? Well novelupdate kinda supports the alligator in my perspectives
Click to expand...

No, a captcha isn't against NU's policy. How exactly is NU "kinda" supporting aggregators?

xiazixin said: ↑

Well novelupdates kinda supports the alligator with no Google Doc policy and all what not.
Click to expand...

I'm not sure where you're getting your information from but NU allows Google Docs.

Ai chan · Apr 2, 2019

xiazixin said: ↑

It have no use, no robots.txt is just some thing for search engines to use. It help reduce the chance appearing on search engines if the search engines respect your robots.txt.
You can remove your site map all you want.
Our site black box internet service translation dose not have a site map and have no indexed on meta of all pages.
The bots just take as usual.
Its easy.
They use the latest chapter release powerby novelupdates dot com. Go to the link. If characters of body in html more than 600, stole this page. If not check first links from this page whether if have a body with more than 600 text.
I aways puts unedited links before my edited links. And the bots fall for it.
Click to expand...

Yes, that's why you use a spider traps together with norobots. Legitimate search engines like google will abide by your norobots rule and will not scrape your content. Aggregates won't honour your norobots rule. Therefore, you put spider traps so that their bots are stuck in an infinate loop of scraping content that don't matter.

Not only you're stopping aggregator bots, you're giving a big middle finger to their faces. Of course, if they know how the spider traps work, they can modify the bot to avoid it, but until they do know, they will continue to waste resources without knowing why.

Zhaernon · Apr 2, 2019

lnv said: ↑

It isn't a tease, I PMd them, if the method is made public, then it can be defeated, by keeping it secret and PMing people, it prevents it from being abused.

If there is only 1 link, a scapper can easily search for links within a certain box and click it. Of course you can have some invisible links to trick the scrapper. But that only makes it partly harder.

Close, think benefit of all without the downsides.
Click to expand...

PM me too xD

niznet · Apr 2, 2019

Instead bloking, why not create massive copyrights to annoy reader? (Not affected when read it on own website)

If I'm not wrong, Rebirth Online World did that...

lenyek_penyek · Apr 2, 2019

Refan25 said: ↑

May this lord introduce you to mangadex.org?
Click to expand...

Thank you my lord. The website, however, doesn't include the dmca mangas like onepiece and many other super popular ones.

I use it a lot though. Its incredible how the site became so good in a matter of 1 year.

Log in

Question Stopping novel aggregators?

yamibae Well-Known Member

xiazixin Well-Known Member

lnv ✪ Well-Known Hypocrite

Wyrede Well-Known Member

yamibae Well-Known Member

lnv ✪ Well-Known Hypocrite

xiazixin Well-Known Member

xiazixin Well-Known Member

yamibae Well-Known Member

Ai chan Queen of Yuri, Devourer of Traps, Thrusted Witch

Kainord Well-Known Member

Wujigege ChristianSIMP*Comedian

lnv ✪ Well-Known Hypocrite

Wujigege ChristianSIMP*Comedian

xiazixin Well-Known Member

Tony Well-Known Member Staff Member

Ai chan Queen of Yuri, Devourer of Traps, Thrusted Witch

Zhaernon Well-Known Member

niznet Well-Known Member

lenyek_penyek Badass Member

Log in

Question Stopping novel aggregators?

yamibae Well-Known Member

xiazixin Well-Known Member

lnv ✪ Well-Known Hypocrite

Wyrede Well-Known Member

yamibae Well-Known Member

lnv ✪ Well-Known Hypocrite

xiazixin Well-Known Member

xiazixin Well-Known Member

yamibae Well-Known Member

Ai chan Queen of Yuri, Devourer of Traps, Thrusted Witch

Kainord Well-Known Member

Wujigege *Christian*SIMP*Comedian

lnv ✪ Well-Known Hypocrite

Wujigege *Christian*SIMP*Comedian

xiazixin Well-Known Member

Tony Well-Known Member Staff Member

Ai chan Queen of Yuri, Devourer of Traps, Thrusted Witch

Zhaernon Well-Known Member

niznet Well-Known Member

lenyek_penyek Badass Member

Useful Searches

Wujigege ChristianSIMP*Comedian

Wujigege ChristianSIMP*Comedian