How-to solve SPAM and Democratize Steem: Introducing UserAuthority

jesta (71) 8 years ago

While I'm largely undecided on the details of how this could be implemented, if it should, and specifically into what) - I think there's a discussion worth having here.

Using the quality of a user (based off followers) to gauge their rewards/impact is a relatively new concept that I'm seeing used more and more in some of the 3rd party services. Based on how they're performing - it might be good to apply the concept to the overall Steem ecosystem or perhaps SMTs.

$2.17

6 votes

cnts (64) 8 years ago

spot on mate.. speaking volume here.. upVoted so as not missed! @cnts :]

$0.00

scipio (65) 8 years ago

Hi Jesta, via DMs, 2 people suggested to me that I should forward this via email to somebody called "sneak". Do you know if that's the same person as account @sneak ? Do you know this person "sneak", and could you perhaps ask him/her to have a look? Or do you think I should send the email?

$0.00

stellabelle (75) 8 years ago

@sneak is the CTO. And yeah, that's him

$0.09

oaldamster (68) 8 years ago

Yes, that'll be the one.

$0.00

stellabelle (75) 8 years ago

Hey, here's sneak's Twitter account in case you want to follow him: https://twitter.com/sneakdotberlin?lang=en

$0.00

jesta (71) 8 years ago

Yup, @sneak is one of the guys on the Steem team.

$0.00

boontjie (52) 8 years ago

Have you considered applying the SVD calculation to speed up computation and basing your calculation on the top 90% of the eigenvalues?

$0.00

scipio (65) 8 years ago

No, not yet and I don't think it would matter that much. As comparing users to pages, on steemit, only a very small amount of users (let's say 500k) compared to pages on the web (a lot more) exist. Computationally it doesn't currently seem to be a problem, so optimizing it would bring little added value. And also, the total follower graph doesn't change drastically daily: a lot of new relationships may be formed and some (a lot less) are terminated, yet all in all that wouldn't make a big difference to the overall end-results. So daily re-calculating the UA binary index (containing about 8 bytes per user, or more if more information is stored in it), is fine.

$0.00

stellabelle (75) 8 years ago

Yes it definitely would. It would solve a lot of problems.

$0.00

jasonbu (61) 8 years ago

This is great. I've seen so many 000.1 spam comments, not to my account (only recently ascended to a level where they notice me) but to others. Seems like it would be a good way to ensure value to Steem, limit the garbage in the chain and add trust. Great explanation of the concept.

$1.40

12 votes

scipio (65) 8 years ago

I might write-up another post on how to combine Keybase.io as an encrypted private chat layer built-into the Steem blockchain! No more 0.001 SBD public memo notifications then! ;-)

$1.36

jasonbu (61) 8 years ago

That would be great. That would take the blockchain much closer to the traditional Social Media platforms or at least give it the ability to extend apps towards that goal. That would definitely add another dimension.

$1.39

scipio (65) 8 years ago

The Steem blockchain already allows for encrypted messages, yet a de facto private chat layer has not been implemented into Condenser yet (= the technical term for the Steemit front-end).

$0.00

othmane (39) 8 years ago

Thank you for this great article. I wish you success because the way to write a good article will get to the best. Thank you again

$0.00

rondellrandall (35) 8 years ago

ok now I'm thoroughly confused, lo9l! but thanks for taking the time to write this blog.

$1.30

6 votes

scipio (65) 8 years ago

Still confused? You can ask questions if you want, I'm happy to respond!

$0.00

moorkedi (57) 8 years ago

woow owwo and woow ... definitely perfect !

$0.54

2 votes

lukestokes (75) 8 years ago

Very interesting. The SMT white paper talks about oracles for controlling who can access an SMT rewards pool and I wonder if algorithms like this might be more effective and/or help aid oracles in making decisions about accounts and the value they bring (or take away) from the network.

I'd love to see an open market place for various spam prevention algorithms that users could implement to improve their own experience. Maybe it could involve shared mute lists or something similar along with regularly published reports of abusers taking from the reward pool but adding no value so that certain accounts like @steamcleaners (or something similar) could go through and downvote them and other busy whales could delegate some SP to help.

$0.50

5 votes

vimukthi (73) 8 years ago

May I add a simple suggestion for a bot. First of all I'm not a programmer. Just a smart guy who love steemit.

You could use a bot to check for repeated comments of a user. Lot's of spammers copy/paste their spam. There are also words like upvote,plz,follow used repeatedly. If a bot could be made no find users with high amount of copy/paste content and for common words used in upvote begging, It could generate a list for whales to review. Then those who are clearly spamming could be picked and put on a public list.

I'm very happy to see many people fighting the good fight. You guys give me hope regarding the future of steemit. My simple suggestion wouldn't go too far. Spammers will adopt. I actually came across serial spammers with reputation above 50. It's just nuts: https://steemit.com/steemit/@vimukthi/serial-spaming-your-way-into-a-reputation-of-56-how-did-this-happen

Wish you guys best of luck and hope my suggestion helped :-)
@vimukthi

$1.36

scipio (65) 8 years ago (edited)

Hi @vimukthi , your suggestion on how to algorithmically detect spam / repeated comments (= content analysis), could be another extension of my UserAuthority (UA) spam identification capabilities (= user authority / popularity analysis).

Your suggestion could also be implemented stand-alone via bi-directional hashing encryption, by calculating "character proximity". Lots of difficult words here ;-), but your solution on its own is not really needed. Compare this principle to copy-pasta webpages: it is impossible to stop authors publishing such webpages, but it's Google's only task to prevent those pages getting to the top of the search rankings (SERPs).

Seen from a Steem ecosystem perspective, it's merely important to stop those comments from receiving high-value author rewards. Hence the need for my UA algo to be implemented Steem-wide (See my HF22 proposal UA * SP at the bottom of my article.)

$0.00

torquewrench1969 (54) 8 years ago

Mathematically, it looks like your UA algo will work just fine :)

Some folks may not like it, but as with all algos there's always room for improvement!

BTW: here's a link to an article on "bot tells"
https://steemit.com/steemit/@torquewrench1969/tips-i-use-to-identify-bot-accounts

$0.00

scipio (65) 8 years ago

Spot on! ;-)

$0.00

torquewrench1969 (54) 8 years ago

Another post comming out about this project shortly!

$0.00

scipio (65) 8 years ago (edited)

An equivalent of this algorithm was used, in another form, as the only relevant algorithm, when Google launched in 1998. In this form, the same problem Google had in 1998 is addressed: which page should be at the top results, ref. which user is to be regarded as authorative.

$1.30

paulag (73) 8 years ago

This is awesome. We need some sort of change like this. And whats even better is you explained it with my lanaguage, Excel! Steem on, I have upvoted and resteemed

$0.33

scipio (65) 8 years ago (edited)

Haha, well, I like Excel for explaining stuff and quickly setting up a POC-app-ish kind of thing....
And I like your work, as well as yourself, as well, Paula :-). Steem on!

$0.00

paulag (73) 8 years ago

I love when I make new friends, steemit is awesome for that anyway!

$0.00

scipio (65) 8 years ago

I feel the same! :-)

$0.00

stellabelle (75) 8 years ago

Likewise. It's good to see we are all on the same page about this.

$0.00

windrockswater (48) 8 years ago

This is fantastic! A tremendous idea and post, genuine and heartfelt feedback, from a diverse community, conversation and dialogue leading to meaningful changes! I have no idea what your talking about, but I love every word!!!

$0.25

scipio (65) 8 years ago

This is also a cool comment! :-)
It's perfectly okay if you don't understand the math. But let me try explaining as simple as I can:

UA is a derived metric (all the data is in the blockchain already).
The math behind my UserAuthority (UA) algorithm calculates how "popular" an account is, by taking into account who follows that account. And that's done automatically for every user in the entire Steem system.
not all follows are weighed equally
randomly clicking any user's follower list (who follows that user) is that follower's UserAuthority. So it is the probability a random click has to get to said account via its followers.

Okidoki? Better? Better...
(Otherwise, eat a Snickers! ;-) )

$0.00

windrockswater (48) 8 years ago

Much better I get it! I love snickers by the way, eat 'em all the time. Very gracious of you to make this effort to explain, appreciate that as much as the explanation itself. Thank you.

$0.26

scipio (65) 8 years ago

Everybody needs to understand in order to be capable of "judging" what I propose!
I gradually updated my article with more explanations day by day. At first, people seemed afraid to comment (maybe afraid of looking like an idiot?). But the smartest people around, are the people asking questions by first outing they didn't understand.

I get confused with cooking myself! Although my baked eggs are nice ... :-)

$0.00

stellabelle (75) 8 years ago

Let's do this. I had time to read this several times, I see how it needs to be done.

$0.18

scipio (65) 8 years ago (edited)

Cool! :-)

PS: please just upvote yourself to the top for visibility, I can't add more than $0.10 myself on your comment! :P
PS2: Jesta's comment (had a long talk with Jesta yesterday) is in here as well, same thing: I don't have the power to get his comment to the top.

$0.00

stellabelle (75) 8 years ago

Looks like your my boss now?

$0.11

scipio (65) 8 years ago

Whahaha! Please, no! I've been "boss" at work since 1999, and have tried to motivate people to take charge and show initiatives. Yet people keep on asking me for approval, no matter what I do. I kind of like being the new kid in town here!

$0.00

steemulator (51) 8 years ago

New users, myself included, who have little/no userauthority, will be equated to the bots - is my understanding of your proposal correct?

$0.16

scipio (65) 8 years ago (edited)

Good question! Short answer: no, you're not seen as a bot, at all!
Longer answer: the proposed mechanism is extendable with all sorts of metrics, that combined identify bots / spammers extremely accurately, distinguished from new users, like you, and more prominently: me! (my own account is brand new!).

Examples:

very low UA, very high amount of posts (that's a sign, right? lots of messages but nobody follows..)
very low UA, very high rewards (that's even stranger!)
very low UA, yet it has lots of followers (spooky....!)
very high UA, zero posts (hmmm, that seems like a passive whale account...)
etc.

PS: in case an account has zero followers (new accounts), they are not found on the total link graph. Yet they are easily includable via a "NewUsers bot", detecting accounts with zero followers, and following them for inclusion within the follower graph.

$0.03

steemulator (51) 8 years ago

Thank you for explaining this! I know I am not a bot, let's hope Steemit will also know it : D

$0.03

scipio (65) 8 years ago

Steem then first needs to implement my code via HF21 ;-) But Utopian-IO probably will do so before that!

$1.36

9 votes

pastbastard (53) 8 years ago

It's actually fairly simple and really elegant. Not sure if it was your explanation or the concept itself, but I like it. Thanx for the hard work, Steem and Steemit need it!

$0.13

scipio (65) 8 years ago (edited)

Thx! When first publishing the article I just wanted to help Utopian improve its bot algorithm (then v1, now v2, hoping to implement UA in its V3 the coming days). Yet only after publishing it, a few hours later, I realized the very same algo can "sanitize" Steem.

I encourage everybody looking at this page to openly express their doubts, questions, etc.

$0.00