You will need more memory to read faster. Your local computer has limited RAM but if you are doing it on server it will be faster because server has more RAM
I tried using the count method in pandas, I keep getting this error; AttributeError: 'Cursor' object has no attribute 'count' This is the code; dw_db = client.doctorwho tc = dw_db.test_collection tc.insert_one({'name': 'William', 'birthyear': 1908}) # Insert a few (zip takes some lists and returns a list of tuples) for n, b in zip('Patrick Jon Tom Peter Colin Sylvester Paul Christopher David Matt Peter'.split(), [1920, 1919, 1934, 1951, 1943, 1943, 1959, 1964, 1971, 1982, 1958]): tc.insert_one({'name': n, 'birthyear': b}) dw_db.tc.find({'name': 'Peter'}).count()
clear audio clear cut knowledge transfer.Thanks
You're welcome
Bro, you're helping me so much , thanks and new suscriber
H! Thank you.
Great video, from my favorite teacher
Thank you Bruna
it was so so sooooo helpful. thank you very very much
I'm so glad!
is there any way to iterate over document in collection and save it as dataframe
Yes, you can. Please check the official documentation for more details.
@@StatsWire I didn't find it in document can you help me with the link
@@abdulazizshaikh2255 I will share with you shortly
Great video. Very helpful showing the pandas dataframe stuff
Thank you!
Is there a convenient way creating pyspark dataframe instead of pandas??
Pandas is the easiest way I would say
@@StatsWire yeah, I know but to maintain the consistency, I have to do it with pyspark Or glue dynamic frame 🤦♂️🤷♂️
Thanks for the video .. i need to compare the Source data in mongoDb and Target (Snowflake ) after ETL i.e is loading into Datawarehouse.
You're welcome!
if i want to laod large data from mongodb to pyspark how to speed up the process very fasily of readig data and writimg data
You will need more memory to read faster. Your local computer has limited RAM but if you are doing it on server it will be faster because server has more RAM
Collection saved in mongo Db is saved on your system or on server ?
This mongodb is in my system i.e local but if your mongdb is configured on the server then it will be saved on the server.
How can I write code to show List of Outer Most field from that text documents???
Sorry, I am not getting your point.
am trying in colab its error
FileNotFoundError: [Errno 2] No such file or directory: 'C:\\Users\\Ravi\\OneDrive\\Desktop\\streamlit\\jagan.csv'
You have to give proper path
Nice 👍
Thank you
How can I display MongoDB collection in an HTML using ajax dataTable?
Hi, Do you find any method
Please refer to the below link on stack overflow
stackoverflow.com/questions/67730969/need-assistance-with-mongodb-python-html-and-ajax
Please refer to the below link on stack overflow
stackoverflow.com/questions/67730969/need-assistance-with-mongodb-python-html-and-ajax
@@StatsWire This is actually my post on Stack.
@@StatsWire This is actually my post on Stack. But now I need a way to display multiple collections.
Thanks for this informative video can you please CREATE VIDEO ON how can you fetch data batch by batch, instead of fetching all the data using find
You're welcome. Thanks for the suggestion. I can make a video on this soon maybe next week because this week I am occupied.
@@StatsWire Did you make this video? please share the url?
Se merece el like, era algo sencillo
Thank you
Thanks bro
You're welcome
thanks!
You're welcome!
nice
Thank you
nice bell.. too much volume
Haha sorry
I tried using the count method in pandas, I keep getting this error;
AttributeError: 'Cursor' object has no attribute 'count'
This is the code;
dw_db = client.doctorwho
tc = dw_db.test_collection
tc.insert_one({'name': 'William', 'birthyear': 1908})
# Insert a few (zip takes some lists and returns a list of tuples)
for n, b in zip('Patrick Jon Tom Peter Colin Sylvester Paul Christopher David Matt Peter'.split(),
[1920, 1919, 1934, 1951, 1943, 1943, 1959, 1964, 1971, 1982, 1958]):
tc.insert_one({'name': n, 'birthyear': b})
dw_db.tc.find({'name': 'Peter'}).count()
Hello, I think count method is deprecated, use the new method count_documents and let me if you still facing issue
@@StatsWire I used it and still had the same issue