You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I searched other issues (including closed issues) and could not find any to be related. If you find related issues post them below or directly add your issue to the most related one.
Related issues:
add them here
Describe the bug
I have been trying to use the article.date_modify function to extract the modified date and time from different newspaper websites.
The function returns None despite the site having a modified date. This is the case for every article URL I tried this function with.
To Reproduce
!pip3 install news-please #ran this on Google Colab
from newsplease import NewsPlease
url1 = 'https://www.thequint.com/news/law/supreme-court-article-370-jammu-and-kashmir-reorganisation-cases-hearing-govt-affidavit-rejoinder'
article = NewsPlease.from_url(url1)
print(article.date_modify)
# prints None
Expected behavior
I expected the code to return the date-time instance when the article was modified, in this case 2019-11-14 19:40:00
Log
Nothing to add here. I just tried the code as shown in the To Reproduce section.
Versions (please complete the following information):
Google Colab
Python Version 3.6.9
news-please Version 1.5.3
Intent (optional; we'll use this info to prioritize upcoming tasks to work on)
personal
academic
business
other
Some information on your project: Extracting modified date from newspaper articles
The text was updated successfully, but these errors were encountered:
Hi! I confuse when exploring the main/core code, so my solution to this problem is creating a new pipeline dedicated to altering the default date_modify. I use same concept as DateExtractor but now I am looking for dateModified in application/ld+json tag
Mandatory
Related issues:
Describe the bug
I have been trying to use the
article.date_modify
function to extract the modified date and time from different newspaper websites.The function returns
None
despite the site having a modified date. This is the case for every article URL I tried this function with.To Reproduce
Expected behavior
I expected the code to return the date-time instance when the article was modified, in this case
2019-11-14 19:40:00
Log
Nothing to add here. I just tried the code as shown in the
To Reproduce
section.Versions (please complete the following information):
Intent (optional; we'll use this info to prioritize upcoming tasks to work on)
personal
academic
business
other
Some information on your project:
Extracting modified date from newspaper articles
The text was updated successfully, but these errors were encountered: