The website above only shows partial content. Have to click on the title again for whole content but i want it to automatically all content being crawl and being shown in CSV)

computer science

Description

What to do?


- Crawl a whole website using python programming. I use sublime.text

  The website: http://www.police.gov.bn/Lists/News/AllItems.aspx


- Overall, I need it also for the CSV file (excel) 

  Title

  Date 

  Url

  The whole content of the title. 

         (The website above only shows partial content. Have to click on the title again for whole content

          but i want it to automatically all content being crawl and being shown in CSV)


  For example: 

  Title : KAWAL KESELAMATAN KAMPUNG MELALUI PENGAWASAN KEJIRANAN (this is in Malay Language, you can view on the latest website as it is the same)

  Date: Jumaat, 13 Sept

  Url: http://www.police.gov.bn/Lists/News/ItemDisplayForm.aspx?ID=1581&ContentTypeId=0x01004EE1520DB09AC547981D92B83F9B1E48  (Have to click on the title for whole story)

  The whole content: Start from 'Keutuhan hubungan bermasyarakat sesama berjiran mewujudkan... (First paragraph)

                     End until ' .... di kampung mereka bebas dari jenayah. " (Last paragraph)


- This crawl should be keep on updating like it retrieve latest news (loop) 



- After above done, if there's a way for Translator API. 

  To translate from Malay Language to English Language. 

  Since this website is in Malay.                     

     


Related Questions in computer science category