In similar question when asking, is it possible to give more weight to a word when the question title is shorter

Question

In similar question when asking, is it possible to give more weight to a word when the question title is shorter

1 Answer

gidgreen · Answer 1 · 2011-06-03T02:57:12+0000

Show 2 previous comments

commented Mar 3, 2017 by Mélanie

File: qa-db-selects.php

function qa_db_related_qs_selectspec($voteuserid, $questionid, $count=QA_DB_RETRIEVE_QS_AS)
/*
Return the selectspec to retrieve the $count most closely related questions to $questionid,
with the corresponding vote made by $voteuserid (if not null). This works by looking for other
questions which have title words, tag words or an (exact) category in common.
*/
{
$selectspec=qa_db_posts_basic_selectspec($voteuserid);

$selectspec['columns'][]='score';

// added LOG(postid)/1000000 here to ensure ordering is deterministic even if several posts have same score

$selectspec['source'].=" JOIN (SELECT postid, SUM(score)+LOG(postid)/1000000 AS score FROM ((SELECT ^titlewords.postid, LOG(#/titlecount) AS score FROM ^titlewords JOIN ^words ON ^titlewords.wordid=^words.wordid JOIN ^titlewords AS source ON ^titlewords.wordid=source.wordid WHERE source.postid=# AND titlecount<#) UNION ALL (SELECT ^posttags.postid, 2*LOG(#/tagcount) AS score FROM ^posttags JOIN ^words ON ^posttags.wordid=^words.wordid JOIN ^posttags AS source ON ^posttags.wordid=source.wordid WHERE source.postid=# AND tagcount<#) UNION ALL (SELECT ^posts.postid, LOG(#/^categories.qcount) FROM ^posts JOIN ^categories ON ^posts.categoryid=^categories.categoryid AND ^posts.type='Q' WHERE ^categories.categoryid=(SELECT categoryid FROM ^posts WHERE postid=#) AND ^categories.qcount<#)) x GROUP BY postid ORDER BY score DESC LIMIT #) y ON ^posts.postid=y.postid";

array_push($selectspec['arguments'], QA_IGNORED_WORDS_FREQ, $questionid, QA_IGNORED_WORDS_FREQ, QA_IGNORED_WORDS_FREQ,
$questionid, QA_IGNORED_WORDS_FREQ, QA_IGNORED_WORDS_FREQ, $questionid, QA_IGNORED_WORDS_FREQ, $count);

$selectspec['sortdesc']='score';

return $selectspec;
}
}

commented Mar 3, 2017 by Mélanie

commented Mar 3, 2017 by Nip351
edited Mar 6, 2017 by Nip351

Im wondering if I could score buff using +.5 by counting the length of the question string too.

To see this via sql, you can use:

This will show the title of the post (Question) and the character count, including spaces:
SELECT title, length(title) FROM `qa_posts` WHERE title is not null;

This will show the title of the post (withoutSpaces) and the character count, excluding spaces as they will not be present.
SELECT replace(title,' ','') as title, length(replace(title,' ','')) as len_Count FROM `qa_posts` WHERE title is not null;

So, the issue now is that...
Hippopotamus, Eye, Toe will return 18 (for three words)
Toe, Eye, Hand, Nail, Neck will return 18 (for five words)

Overall it may be a bit better at best, but not 100% simply because two questions can have the same count, but at least the keywords COULD be different, keeping similar within this. for example, if I searched for Eye...both of these would be returned. But it would work if I searched for Hippo.

Now the trick is somehow implementing this count into the search result evaluation.

I just realized this question was asked in 2011. :-(

In similar question when asking, is it possible to give more weight to a word when the question title is shorter

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Categories

In similar question when asking, is it possible to give more weight to a word when the question title is shorter

Please log in or register to add a comment.

Please log in or register to answer this question.

1 Answer

Please log in or register to add a comment.

Related questions

Categories