How to prevents content stealer?

Discussion in 'Tech Discussion' started by Vijaya, Apr 18, 2019.

  1. Vijaya

    Vijaya Well-Known Member

    Joined:
    Nov 14, 2018
    Messages:
    57
    Likes Received:
    31
    Reading List:
    Link
    Do you know any script or 3rd party apps for blogger/blogspot to lock the content?

    Before I use picture/jpg to prevent any novel-reading sites which steal our hardwork
    But after I found out some sites also steal picture/jpg format, somehow I lost hope to use that
    I could use fake update/post, but somehow I'm too lazy to do it -_-
    Too many posts _( : 3」

    Any recommendations?
     
    Last edited: Apr 18, 2019
    lilysmith likes this.
  2. Warger

    Warger Well-Known Member

    Joined:
    May 15, 2016
    Messages:
    146
    Likes Received:
    228
    Reading List:
    Link
  3. zivoy

    zivoy Well-Known Member

    Joined:
    Jun 19, 2017
    Messages:
    36
    Likes Received:
    59
    Reading List:
    Link
    you can take a look at what the group "Hikki no Mori Translations" is doing to lock their content from aggressors as far as I know they have the most effective method.
    specifically, they started implementing the protection on chapter 75 of "Akuyaku Reijo Ni Koi Wo Shite".
     
    Last edited: Apr 18, 2019
    Bright_Lucky_Star likes this.
  4. SoulZer0

    SoulZer0 Heaven Refining

    Joined:
    Oct 25, 2016
    Messages:
    12,478
    Likes Received:
    24,484
    Reading List:
    Link
    Can't steal your content if you don't have any content.
    09-roll-safe.w710.h473.2x.jpg
     
    Wujigege, cap.toon, kenar and 4 others like this.
  5. vancenick

    vancenick Well-Known Member

    Joined:
    Jun 23, 2016
    Messages:
    380
    Likes Received:
    279
    Reading List:
    Link
    Watermark and non standard formatting is always a classic
     
  6. Jeebus

    Jeebus Well-Known Member

    Joined:
    Jun 20, 2017
    Messages:
    904
    Likes Received:
    780
    Reading List:
    Link
    Anything that a person can read can be scraped, and the more effective things you can do to prevent aggregators from scraping your work will also make it more difficult for a legitimate user to read your work. Worse yet, even the more extreme measures won't be enough to deter most aggregators.

    In fact, if you do too much to deter aggregators, and make your content too difficult to access for your legitimate readers, you'll end up driving them to aggregators instead.

    Just look at the big players like WW, Volare, and Qidian. They are all on aggregator sites. If they aren't able to stop it, do you have the resources to do so?

    My advice is to just ignore the aggregators. Let them be douches. Focus on the positives, like your passion for the work you're translating, and the community that's developing around it.
     
    Wujigege and runsing like this.
  7. Vijaya

    Vijaya Well-Known Member

    Joined:
    Nov 14, 2018
    Messages:
    57
    Likes Received:
    31
    Reading List:
    Link
  8. Vijaya

    Vijaya Well-Known Member

    Joined:
    Nov 14, 2018
    Messages:
    57
    Likes Received:
    31
    Reading List:
    Link
    Thanks for the recommendation
     
  9. Causal Reader

    Causal Reader Well-Known Member

    Joined:
    Jul 3, 2016
    Messages:
    305
    Likes Received:
    84
    Reading List:
    Link
    There isn't any app or script to prevent a determined content stealer from stealing the content when you post it on the web. Your current method is pretty solid, although the down side of using that is discouraging a certain number of visitors to visit your site based on convenience of reading your content. If you would like to know how they bypass your content is simple first they make a snapshot follow by using a word scanning app to create the content or simply using the latest scan app to do the job.
     
  10. Jeebus

    Jeebus Well-Known Member

    Joined:
    Jun 20, 2017
    Messages:
    904
    Likes Received:
    780
    Reading List:
    Link
    Most aggregators are automated. They use a headless browser to request (scrape) the web page. Then, they use OCR to convert it to text. Then, it's uploaded to the web server before being made available on the aggregator's site.
     
  11. Haylias

    Haylias Well-Known Member

    Joined:
    Oct 20, 2018
    Messages:
    162
    Likes Received:
    549
    Reading List:
    Link
    Just DDOS [censored] and all the rest. Extreme problems require extreme solutions.
     
  12. Causal Reader

    Causal Reader Well-Known Member

    Joined:
    Jul 3, 2016
    Messages:
    305
    Likes Received:
    84
    Reading List:
    Link
    Yes that right it prevents the casual content steal from happening but Vijaya is gripping that her security measures wasn't enough to stop one of the aggregator from stealing the content.
     
  13. rijimon17

    rijimon17 Hope you can read the words

    Joined:
    Aug 9, 2016
    Messages:
    545
    Likes Received:
    532
    Reading List:
    Link
    There was this one translator site that coded there translations in a way so when it is copy&pasted it shows a garbled mess. Cant remember the website or story but an example would be

    Website: and so li yo took a deep breath, went through the plan one more time in his head, and entered the gates.

    Copy/paste: fjxucbfj djdbux ddbdjic Dr8fu48odge wif84boven Davhx8did f djdifb Djr8rrJ icehf8d didochbdkd djdjffbdi didbfhficj dichfjcoeb did tvjdochfgud djdieh.

    Edit: I found this out after copy/pasting a word I didn't understand into Google only to find gibberish in the search bar
     
  14. Jeebus

    Jeebus Well-Known Member

    Joined:
    Jun 20, 2017
    Messages:
    904
    Likes Received:
    780
    Reading List:
    Link
    What I described is how the aggregators steal the content. It's supid easy to make a scraper in just about any language you can imagine. All dynamic web pages are generated algorithmically. That being the case, all you have to do is traverse the DOM to your payload, which will be in the same place for every chapter. After that, you do formatting, like removing the watermarks that are in some releases or OCR if the payload text is in an image.

    That's why I'm saying that if I can access a webpage and read the content, so can a scraper. I was able to make a scraper for NU in an evening.
     
  15. Jeebus

    Jeebus Well-Known Member

    Joined:
    Jun 20, 2017
    Messages:
    904
    Likes Received:
    780
    Reading List:
    Link
    You should probably edit your post before the mods do. Linking to aggregators, even if it's your own work, is frowned upon, here.
     
    SoulZer0 likes this.
  16. SoulZer0

    SoulZer0 Heaven Refining

    Joined:
    Oct 25, 2016
    Messages:
    12,478
    Likes Received:
    24,484
    Reading List:
    Link
    OCR too op
     
  17. positiveaddictchan

    positiveaddictchan Well-Known Member

    Joined:
    Nov 23, 2018
    Messages:
    26
    Likes Received:
    204
    Reading List:
    Link
    So content stealers also copy the images?
    I'm just going to experiment and see what happens.
     
  18. SoulZer0

    SoulZer0 Heaven Refining

    Joined:
    Oct 25, 2016
    Messages:
    12,478
    Likes Received:
    24,484
    Reading List:
    Link
    Like I said, they just run OCR on your images than post it on their site.
     
  19. positiveaddictchan

    positiveaddictchan Well-Known Member

    Joined:
    Nov 23, 2018
    Messages:
    26
    Likes Received:
    204
    Reading List:
    Link
    Welp. Can't avoid them.
     
  20. TamaSaga

    TamaSaga Well-Known Member

    Joined:
    Oct 11, 2016
    Messages:
    1,726
    Likes Received:
    2,173
    Reading List:
    Link
    Because of how annoying the ad wall was (wait 5 seconds repeat 3 times every time you hit the next chapter button), I was seriously considering writing a decrypter for that site...