European Youth Portal
>>> block.a
<ahref="/youth/volunteering/organisation/48592_en"target="_blank">"Academy for Peace and Development" Union</a>
>>> block.a.text
'"Academy for Peace and Development" Union'
>>> block.select_one('div > div > p:nth-child(9)')
<p><strong>PIC:</strong> 948417016</p>
>>> block.select_one('div > div > p:nth-child(9)').text
'PIC: 948417016'
what is aimed in the end - i want to gather the first 20 results of the page - and put them in to a sql-db or alternatively show the information in a little widget
Can everyone (Microsoft, LibreOffice, Euro-Office, Apple) just come together and use one standard format?
I appreciate that everyone is turning around and saying, "but our format is better! Use our format!" but it just seems a bit pointless.
I just want to read and edit a document, for goodness sake...part of the reason I try and avoid the documents that people share is because it's too much of a headache. I don't care if you use Windows, Mac or Linux, but if I can't read the damned thing because of your preference I'm going to reply saying, "I can't open that and I'm not buying a new computer to read a 2 page document."
I noticed this was already happening within my organization; my teams location will change between remote and on-site without me having to do anything. Is it possible this is live already for select customers?
While LibreOffice is not pleased to see a new competitor, they are absolutely correct in stating that Euro-Office using a MS file standard as a default is not being truly "European." Using a MS standard just means Euro-Office is just a "bastardized MS Office Suite." (Wasn't a major purpose of Euro-Office was to get away from being captive and enslaved to MS's Office Suite??)
Question
tarifa
hello dear experts and friends on Neowin,
i fairly new to simple_html_dom usage and methods. I know a little the parser,
i want to gather some information from this site:
https://europa.eu/youth/volunteering/organisations_en#open
is this possible to get the content - of let us say 10 or 20 last records on that page - and subesquently to store it in my mysql - db!?
I have the above code and I'm trying to get certain elements of the page but it isn't returning anything.
Is it possible that certain PHP functions might be disabled on the server to stop that?
The above code works perfectly on other sites.
Is there any workaround?
btw: i have created a small snipped as a proof of concept to run this with Python and BeautifulSoup -
import requests from bs4 import BeautifulSoup url = 'https://europa.eu/youth/volunteering/organisations_en#open' response = requests.get(url) soup = BeautifulSoup(response.content, 'lxml') print(soup.find('title').text) block = soup.find('div', class_="eyp-card block-is-flex")and this....
what is aimed in the end - i want to gather the first 20 results of the page - and put them in to a sql-db or alternatively show the information in a little widget
Link to comment
https://www.neowin.net/forum/topic/1392615-simple_html_dom-simple-use-case-to-get-back-data-for-storing-in-sqlite-db/Share on other sites
9 answers to this question
Recommended Posts
Create an account or sign in to comment
You need to be a member in order to leave a comment
Create an account
Sign up for a new account in our community. It's easy!
Register a new accountSign in
Already have an account? Sign in here.
Sign In Now