From 2a5b735c04733d34dd889c3a9f10de3aa2059a8a Mon Sep 17 00:00:00 2001 From: riseandshine0 Date: Wed, 21 Aug 2019 14:57:31 +0200 Subject: [PATCH] new .gitignore New Folder Idea to discuss approach --- .gitignore | 134 +++++++++++++++++++++++++++++++++++++++++++ idea/000-proposal.md | 61 ++++++++++++++++++++ idea/1.Headline.md | 7 +++ 3 files changed, 202 insertions(+) create mode 100644 .gitignore create mode 100644 idea/000-proposal.md create mode 100644 idea/1.Headline.md diff --git a/.gitignore b/.gitignore new file mode 100644 index 0000000..a1c7e39 --- /dev/null +++ b/.gitignore @@ -0,0 +1,134 @@ +# Created by .ignore support plugin (hsz.mobi) +### Python template +# Byte-compiled / optimized / DLL files +__pycache__/ +*.py[cod] +*$py.class + +# C extensions +*.so + +# Distribution / packaging +.Python +build/ +develop-eggs/ +dist/ +downloads/ +eggs/ +.eggs/ +lib/ +lib64/ +parts/ +sdist/ +var/ +wheels/ +pip-wheel-metadata/ +share/python-wheels/ +*.egg-info/ +.installed.cfg +*.egg +MANIFEST + +# PyInstaller +# Usually these files are written by a python script from a template +# before PyInstaller builds the exe, so as to inject date/other infos into it. +*.manifest +*.spec + +# Installer logs +pip-log.txt +pip-delete-this-directory.txt + +# Unit test / coverage reports +htmlcov/ +.tox/ +.nox/ +.coverage +.coverage.* +.cache +nosetests.xml +coverage.xml +*.cover +.hypothesis/ +.pytest_cache/ + +# Translations +*.mo +*.pot + +# Django stuff: +*.log +local_settings.py +db.sqlite3 +db.sqlite3-journal + +# Flask stuff: +instance/ +.webassets-cache + +# Scrapy stuff: +.scrapy + +# Sphinx documentation +docs/_build/ + +# PyBuilder +target/ + +# Jupyter Notebook +.ipynb_checkpoints + +# IPython +profile_default/ +ipython_config.py + +# pyenv +.python-version + +# pipenv +# According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control. +# However, in case of collaboration, if having platform-specific dependencies or dependencies +# having no cross-platform support, pipenv may install dependencies that don't work, or not +# install all needed dependencies. +#Pipfile.lock + +# celery beat schedule file +celerybeat-schedule + +# SageMath parsed files +*.sage.py + +# Environments +.env +.venv +env/ +venv/ +ENV/ +env.bak/ +venv.bak/ + +# Spyder project settings +.spyderproject +.spyproject + +# Rope project settings +.ropeproject + +# mkdocs documentation +/site + +# mypy +.mypy_cache/ +.dmypy.json +dmypy.json + +# Pyre particletype checker +.pyre/ +/.idea/ +.idea/.gitignore +.idea/Chess.iml +.idea/inspectionProfiles/ +.idea/misc.xml +.idea/modules.xml +.idea/sonarlint/ +.idea/vcs.xml \ No newline at end of file diff --git a/idea/000-proposal.md b/idea/000-proposal.md new file mode 100644 index 0000000..14c72f0 --- /dev/null +++ b/idea/000-proposal.md @@ -0,0 +1,61 @@ +#Proposing Algorithm + +#####1) The critical thinking model applied on humans can also be applied to a program in order to write an algorithm that detects a fake news. The program can be written in several parts ensuring that each module carry out only a single step from the steps below. + +#####Critical Thinking Model: + +1. Read the headline. +2. Read the entire article. +3. Don’t believe a word of anything you read until you check facts and check sources. +4. Are the sources and facts credible? Why or why not? +5. Do a quick search engine scan to see who else has covered the story. +6. Do you see two sides (or more) to the article? +7. Are you being spun? Do you feel manipulated? +8. Are other credible news outlets covering the story? +9. Is this story a potential fake news story? + + +###Implementation + + +####Read the headline +The headline will provide the program a rough idea. It may be designed in a way that the headline will be reverse-searched on top search engines and gather all the data from similar headlines into heap. The program will also look up for the data on the source website to estimate the legitness_score of that source. + + + +####Read the entire article +The next steps involves scanning through the whole article word by word and finding relevant patterns that may be crucial to further classify the article into fake or legit. Further the motive of the article may be compared with the headline to predict weather the misleading_title returns True or False + + + +####Don’t believe a word of anything you read until you check facts and check sources +The initial overall trust_score of the article always always remains -1 until all the scores are calculated i.e The program will always consider the news to be fake unless it had completely processed it, hence not giving any preference to BBC.com over FakeNews.com and both considered a fake initially + + + +####Are the sources and facts credible? Why or why not? +The source of the current article, the author and the images on the article are reverse-searched to ensure the credibility of the source. the history of posts from the same author and images uploaded on the article are original or just carried forward from other sources and articles + + + +####Do a quick search engine scan to see who else has covered the story. * + + +####Do you see two sides (or more) to the article? +This step may involve checking if the article is comparing one entity with another example, political parties. The job of the program here is to determine what is being talked about here and what is it compared with eg: An article constantly comparing Males and Females + + + +####Are you being spun? Do you feel manipulated? +The next part will help determine if the article is biased towards one side more than the other, in the above example if the article is about Males and Females, the program checks if there's any bias to the comparison, One being favoured more over other and calculate the bias_score . When in favour of females the bias_score for females will be shown as +1 and -1 for men. unbias will be reflected with a bias_score totalling to 0 + + + +####Are other credible news outlets covering the story? + + +####Is this story a potential fake news story? +Finally after everything is taken into consideration, The parameters will be used to label the data to be a fake or a legit + + + diff --git a/idea/1.Headline.md b/idea/1.Headline.md new file mode 100644 index 0000000..a0c6171 --- /dev/null +++ b/idea/1.Headline.md @@ -0,0 +1,7 @@ +#1. Headline + +###Ideas for executing a headline rating + +####1.1 Trigger word list +Check the words contained in the headline against a pre determined list of words. +When reaching a set score the headline will be rated good or bad. \ No newline at end of file