Features in common | SQL Server | SharePoint |
Container | Database | Site |
Operations | Insert,Update,Delete,Query | Insert,Update,Delete,Query |
Metadata | Columns | Columns |
Data volume | Billions of records | Millions of records |
Event trigger | Yes | Yes |
Query language | SQL | CAML |
Apart from the exciting SharePoint features, such as versioning, soon I noticed two differences.
- All SPListItems are stored in one SQL table;
- Compare to SQL Server, "Insert,Update,Delete" operations of SPListItem are extremely slow;
Definitely NOT!
This is the conclusion I got after seven years of development and administration on SharePoint platform. Believe it or not, I feel that SPListItem/SPFile is actually Message, and SPList/SPContentType is Queue.
Let me show an example here.
If there are millions of items in a SPList, and we need to update the "Title" field of all data based on some rules, how should we do it? We cannot update data in batch, instead, we have to do the update one by one. This is how we process messages, not records.
Once we understand this point, then it's easy to understand why all SPListItems are stored in one SQL table, why the modification is so slow, and how we can utilize SharePoint in better way.
What do you think? Any comments are welcome!
interesting, what other analogies can you draw between lists and queues?
ReplyDeletealso, you can do batch updates on list items by using SPWeb.ProcessBatchData
Thanks for comment.
Delete"SPWeb.ProcessBatchData" actually accepts "parameters" in batch, but in the background, the data is still processed piece by piece.
Many SharePoint "personalities" are anti-logical. Performance is one, permission management is another one. I plan to write a post regarding BCS, which is also awkward.
But, in the end, I still like it :-)
I just want to thank you for the work that you do and your willingness to share. You have created outstanding materials and we all benefit from your generosity.
ReplyDelete